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This invention was made with Government support under Grant No. POl AG- 
10435 awarded by the National Institute of Aging. The U. S. Federal Government has 
5 certain rights in the invention. 

FTET.n OF THE INVENTION 

The present invention relates to methods in the field of recombinant DNA 
technology, and products related thereto. In a particular aspect, the invention relates to 
O methods for modulating the expression of exogenous genes in mammalian or non- 



1 0 mammalian systems, and products useful therefor, 
jfi BACKGROUND OF THE INVENTION 

It is known in the art to produce fiision proteins for a number of purposes. In 
K some cases, the two protein units in the single polypeptide have two essentially 

ly independent activities. The most common example of this application is the fusion of 

2 15 marking proteins, such as GFP, to intracellular factors as a means of observing their 

localization and expression (see, for example, A.W. Kerrebrock et al. Cell, 51:247- 
56, 1995; H.G. Wang et a/., Cell, 87:629-638, 1996). Creation of fusion proteins has 
also been used to prolong the half-life of a protein (see, for example, R. A. Hallewell, 
et al, J. Biol Chem,, 264:5260-5268, 1989; T.P. Yao et al. Cell, 27:6372, 1992) as 
20 well as other uses (see, for example, T. Sano et aL, Proc Natl Acad Sci U.SA. , 
82:1534-1538, 1992). 

A more complicated application of protein fusion is the production of fusion 
proteins wherein the two protein units cooperate to achieve a biological function. In 
functional dimers, both proteins must fold and interact with each other appropriately. 
25 V.A. Garcia-Campayo et al {Nature Biotech, 15:663-667, (1997)) have utilized a 

peptide linker to fuse gene subunits together into a single biologically active peptide. 
Neuhold and Wold, {Cell, 74:1033-1042, (1993)) have reported the fusion of two 



proteins into a single biologically active protein that binds DNA targets, wherein the 
protein units interact with each other to the exclusion of competing heterodimer 
partners. However, fusion of proteins with multiple functions has been more difficult 
to produce, for example, steroid/thyroid hormone nuclear receptors are complex, 
multifunctional proteins with, minimally, four interconnected yet separable functions: 
ligand binding, dimerization, DNA binding, and transactivation. 

Steroid/thyroid hormone nuclear receptors are used in the field of genetic 
engmeering as a tool for studying control of gene expression and to manipulate and 
control development and other physiological processes. For example, applications for 
regulated gene expression in mammalian systems include inducible gene targeting, 
overexpression of toxic and teratogenic genes, anti-sense RNA expression, and gene 
therapy (see, for example, R. Jaenisch, Science 240:1468-1474, 1988). For cultured 
cells, glucocorticoids and other steroids have been used to induce the expression of a 
desired gene. 

As another means for controlling gene expression in mammalian systems, an 
inducible tetracycline regulated system has been devised and utilized in transgenic mice, 
whereby gene activity is induced in the absence of tetracycline and repressed in its 
presence (see, e.g, Gossen etal PA^^S 52:5547-555 1,1992; Gossen etal, TIBS ISA! 1- 
475, 1993; Furth et aL, PNAS 21:9302-9306, 1994; and Shockett et al, PNAS 
£2:6522-6526, 1995). However, disadvantages of the inducible tetracycline system 
include the requirement for continuous administration of tetracycline to repress 
expression and the slow clearance of antibiotic from bone, a side-effect that interferes 
with regulation of gene expression. While this system has been improved by the recent 
identification of a mutant tetracycline repressor that acts conversely as an inducible 
activator, the pharmacokinetics of tetracycline may hinder its use during development 
when a precise and efficient "on-off' switch is essential (see, e.g., Gossen et a/., Science 
268:1766-1769, 1995). 

Certain insect steroid/thyroid hormone nuclear receptors have also been studied. 
The Drosophila melanogaster ecdysone receptor (EcR) (M. R. Koelle et al, Cell 
62:59-77, 1995) is unlike the estrogen, androgen, and other homodimeric vertebrate 
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steroid hormone nuclear receptors because it requires a heterologous dimer partner for 
functional transactivation. The obligate dimer partner, the product of the 
ultraspiracle (Usp) gene (V. C. Henrich et al, Nuc, Acids Res. 18: 4143-4148, 1990; 
T. P. Yao et aL supra, 1992; T. P. Yao et al. Nature 26^:476-479, 1993), is an insect 
homolog of the mammalian retinoid X receptor (RXR) proteins found in vertebrates 
and other mammalian species. RXRs have been characterized as regulatory dimer 
partners of many mammalian class II steroid/thyroid hormone nuclear receptors, such 
as the thyroid hormone receptors, the retinoic acid receptors, and the vitamin D 
receptor (reviewed in Mangelsdorf and Evans, Ce// 83:841-850, 1995; D. J. 
Mangelsdorf al, Cell 83: 835-839, 1995). RXR is also a dimer partner of EcR. 



Usp and RXR share a significant degree of sequence homology and some 
functional similarities; however, in formation of heterodimers with EcR, RXR 
interacts differently than Usp. One primary difference is that formation of EcR+RXR 
heterodimers is more highly stimulated by the steroid ligand ecdysteroid muristerone 

15 A (murA) than by 20-hydroxyecdysone (20-Ec), while formation of EcR+Usp 

heterodimers is potently stimulated by 20-hydroxyecdysone (K. S.Christopherson et 
al, Proc Natl Acad Sci USA 89:6314-6318, 1982; H. E. Thomas et al, Nature 
3^:471-475, 1993). A second difference is in the way that ligand promotes efficient 
formation of EcR+Usp and EcR+RXR heterodimer complexes and concomitant 

20 binding to ecdysone response elements (EcREs). MurA stimulates EcR+Usp binding 
of EcREs approximately 3 to 7-fold over levels without ligand, but EcR+RXR 
complexes are completely dependent on ligand for heterodimerization. Further 
EcR+RXR complexes bind to EcREs at only 10-40% the level of EcR+Usp 
complexes (Christopherson et aL, supra 1982; Thomas et aL, supra 1993; Yao et al, 

25 supra, 1992 & 1993). This suggests that the affinity of EcR for its natural dimer 
partner, Usp, is significantly greater than its affinity for RXR. 

EcR has been studied for use in transgene regulation; however, its use for this 
purpose is complicated by the requirement for superphysiological levels of RXR 
protein to be coexpressed (No et aL, supra 1997), presimiably because of the 
30 comparatively low affinity of EcR for RXR as a dimer partner. Of the mammalian 
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cell types heretofore examined, only the 293 cell line appears capable of supporting 
high level transactivation of EcR without added RXR (Christopherson et al, supra, 
1982). The requirement for co-expression of RXR in most mammalian systems raises 
concerns that RXR will heterodimerize with endogenous mammalian class II 
5 steroid/thyroid hormone nuclear receptors, causing altered differentiation, growth, or 
fitness of transduced cells. 

A mmiber of ecdysone receptors are known in the art as being a gene sequence 
responsive to an applied exogenous chemical inducer enabling extemal control of 
expression of the gene controlled by the receptor (See, for example, 
PCT/GB96/01195). 

Accordingly, there is a need in the art for improved systems to precisely 
modulate the expression of exogenous genes in mammalian subjects. For example, a 
non-mammalian-based transcription regulating system woxald be extremely desirable for 
general application to transgene regulation in in vitro, ex vivo, and in vivo applications. 
In addition, there is a need in the art for new and better methods of using 
steroid/thyroid hormone nuclear receptors that require a dimer partner for functional 
transactivation of transgene expression for use in somatic gene therapy and for 
laboratory models thereof. 

BRIEF DESCRIPTION OF THE INVENTION 

20 In accordance with the present invention, there are provided chimeric proteins 

comprising at least two functional protein units, wherein each functional protein unit 
comprises the dimerization domain of a member of the steroid/thyroid hormone 
nuclear receptor superfamily, and an optional linker interposed therebetween, wherein 
the at least two protein units form a functional entity. When the chimeric protein 

25 contains two functional protein units, the chimeric protein forms a functional dimer 
(FD), for example a heterodimer or a homodimer. In one embodiment according to 
the present invention, each protein unit comprises a ligand binding domain and an 
optional hinge domain of a steroid/thyroid hormone nuclear receptor member, and an 
optional DNA binding domain. The functionality of the entity is independent of the 
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order of the protein units in the chimeric protein. Polynucleotides encoding the 
invention chimeric protein and cells containing such polynucleotide(s) are also 
provided according to the present invention. In one embodiment according to the 
present invention, the invention polynucleotide encodes the invention chimeric 
5 protein as a fusion protein, w^ith one or more linker(s) encoded as a polypeptide linker. 

In accordance v^ith another embodiment of the present invention, there are 
provided methods for modulating the expression of exogenous gene(s) in a subject 
organism containing DNA construct(s) encoding and expressing invention chimeric 
protein(s) and DNA construct(s) encoding and expressing exogenous gene(s) under 
the control of a response element. The invention method for modulating the 
expression of exogenous gene(s) in a subject organism comprises administering to the 
subject an effective amoimt of an exogenous ligand for at least one functional unit of 
the chimeric protein. 

The present DNA binding studies indicate that many of the invention 
functional dimers (FDs) display DNA binding equivalent or superior to that of 
receptor complexes formed from and/or containing identical wild type members of the 
steroid/thyroid hormone nuclear receptor superfamily (i.e., the same two members 
from which the invention chimeric protein is derived). Transient transfection analysis 
reveals that distinct groups of FD constructs transactivate responsive promoters in a 
manner similar to wild-type complexes, while others lose the capacity to transactivate 
and function like constitutive repressors. 

Competition experiments and supporting data reveal that FDs favor 
dimerization with dimer partners contained within a chimeric protein over interaction 
with other wild type dimer partners. These results demonstrate that certain of the 
25 invention chimeric protein FDs share properties of monomeric receptor complexes 
while others have novel characteristics unique to individual constructs. 

To enhance the possibility of producing a functional entity upon expression, 
the invention chimeric proteins allow for any of the protein units to be positioned at 
the amino terminus of the chimeric protein. In addition, to enhance flexibility for 
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proper folding and three-dimensional orientation of the protein units into a functional 
entity, an optional linker can be interposed between the protein units in the chimeric 
protein. A variety of different linkers can be used in the invention chimeric proteins, 
including chemical and polypeptide linkers, with the latter being preferred if the entity 
5 is expressed as a fusion protein. In a presently preferred embodiment, the linker is 
designed to allow for incremental elongation of the linker distance interposed between 
the two protein units. 

BRIEF DESCRIPTION OF THE FIGURES 

In the interests of brevity and consistency, the names of receptors and dimer 
Q 10 partners have been abbreviated for use herein as follows: "E", "U", or "R" alone 

r indicates a monomeric receptor protein or dimer partner (i.e., not contained in an 

I ^ invention chimeric protein) containing, respectively, at least the ligand binding 

rO domain of the Drosophila ecdysone receptor, the ultraspiracle protein, or the retinoid 

1^' X receptor. When contained within an invention "fusion protein", which is 

=i 15 altematively referred to herein as a "functional dimer", these receptor proteins are 

Q represented by "E", "U", or "R" separated by either an "N", representing a linker of 

^ any length, or a numeral from 0 to 20, indicative of a linker containing a specific 

hU nimiber of linker segments wherein each linker segment contains 12 amino acids. In 

the description of invention fusion proteins, which are functional dimers, the leading 
20 letter in the abbreviation indicates the receptor protein at the amino terminus of the 
ftision protein. For example, E5U means a fusion protein having at least the ligand 
binding domain, hinge domain, and optionally functional DNA binding domain of 
Drosophila ecdysone receptor at the amino terminus, a linker containing 5 linker 
segments (of 12 amino acids each, plus the 5 amino acid linker bridge (i.e., a linker 
25 containing a total of 65 amino acids) and the comparable domains of the ultraspiracle 
protein. An initial "V" in the construct abbreviation, e.g., VE5U or VE, indicates 
fusion of the VP 16 x activation domain to the N-terminus of the fusion protein, as 
described more completely hereafter. 



Figure 1 is a schematic diagram of an nucleic acid construct encoding 
30 invention fusion proteins that contain EcR (darkly shaded) with a dimer partner, U 
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(Usp) or R (RXR) (darkly shaded). "D" = DNA binding domain; "L" = ligand 
binding domain; curvilinear line = fusion bridge. "Individual" represents a nucleotide 
sequence that encodes the wild type C terminus of EcR (receptor) and the monomeric 
N-terminus of RXR (binding partner) before introduction of a nucleotide sequence 
5 encoding a fusion bridge. "Fused" represents the same segments with nucleotides 
inserted that encode a 5 amino acid fusion bridge containing the Sfil insertion site. 
"Tether" indicates a nucleotide sequence that encodes a 12 amino acid linker to be 
inserted into the Sfil site of the fusion bridge to produce fusion proteins with greater 
spacing between the two protein units (i.e., dimer partners) in the invention fusion 
10 protein. 

Figures 2A-B illustrate the results of gel mobility shift assays of response 
element binding of the invention FD constructs having linkers containing either 0 or 5 
linker segments as compared v^th that of monomeric receptor complexes (translated 
in vitro). 

Figure 2A is a graph quantifying gel mobility shift as a result of response 
element binding to invention endodimer FDs in the presence of murA. Controls were 
treated either with vehicle (open bars) or with 1 \xM murA as ligand (black bars). Bars 
are labeled along the bottom with FDs named as described in the text. E represents an 
EcR only control; NON represents a non-transfected control. E+U and E+R are 
control lanes of monomeric in vitro translated proteins used for sizing of endodimer 
band shifts. Numbers at the top of each bar represent relative-fold increase in 
response element binding resulting firom ligand treatment. 

Figure 2B is a schematic representation of five F-domain deletion constructs 
containing EcR (darkly shaded) and RXR (lightly shaded) wdth no linker polypeptide 
25 (EOR). Incremental deletions are shovra to Nhel, PvuII, Narl, and Bglll sites within 
the ecdysone receptor F domain. The top schematic represents EOR (1340 amino 
acids) containing the complete F-domain (bracketed); the second schematic represents 
a deletion (A60 amino acids) to the Nhel site; the third schematic represents a deletion 
(A138 amino acids) to the PvuII site; the fourth schematic represents a deletion (A198 
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amino acids) to the Narl site; and the fifth schematic represents a deletion (A228 
amino acids) to the Bglll site. 

Figure 3 is a graph showing relative luciferase expression induced by FD 
constructs with or without ligand as determined in transient transfection assays for 
5 FDs and for monomeric EcR with either Usp or RXR when treated with vehicle (open 
bars) or 1 |aM murA. Decimal numbers on the abscissa represent molar amount of FD 
relative to VE plasmid (LO is equimolar FD:VE). EOU and UOE without VE 
cotransfection are at the extreme right of the bar graph. See also Table 1 . 

Figures 4A-B are two graphs illustrating the results of transient transfection 
assays conducted using either VP16-fused monomeric receptors or invention fusion 
protein FDs with increasing linker lengths. Figure 4A is a graph showing a 
comparison of luciferase activity in relative light units (RLU) in transient transfection 
assays conducted with or without ligand, using either monomeric receptors having 
amino terminal fused VP 16 activation domains or invention FDs containing EcR, 
RXR, and a linker with a variable number of linker segments. Cells were treated with 
vehicle (open bars), or 1 jiM muristerone A as ligand (black bars). Nimibers at the 
top of the bars indicate the fold-increase relative to FD or monomeric receptor 
without addition of monomeric VRXR (VR) or monomeric VUsp (VU). E = EcR 
only; E4-luc = reporter plasmid only; and Figure 4B is a graph showing a comparison 
of luciferase activity as in Figure 4A herein, except that the FDs contain Usp in place 
of RXR. 

Figure 5 is a series of three graphs showing repression of ligand-stimulated 
luciferase expression by monomeric receptors caused by competition with the 
invention ENU and UNE FDs when transiently co-transfected into 293 cells and 
25 treated either with vehicle (open bars), or 1 \iM muristerone A as ligand (black bars). 
Decimal numbers on the abscissa represent molar amoxmt of FD relative to VE 
plasmid (1 .0 is equimolar FD:VE). Figure 5A shows a comparison of the inhibitory 
effects of EOU and UOE on ligand-stimulated expression of luciferase by monomeric 
VE in combination with endogenous RXR. EOU and UOE without VE cotransfection 
30 are at the extreme right of the bar graph; Figure 5B shows the effects of monomeric 
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EcR (without VP 16 fusion) on ligand-stimulated expression of luciferase in the assay 
of Figure 5 A by competition with EOU or UOE; and Figure 5C shows the effects of 
monomeric EcR on luciferase expression in the presence of ligand in the assay of 
Figure 5B, as compared with VE combined with monomeric exogenous Usp. 

Figure 6 is a graph showing a comparison of results (in RLU) obtained in 
assays in which E5U and E5R compete with monomeric VRXR (VR) or monomeric 
VUsp (VU) in 

the presence of vehicle only (open bars) or murA as ligand (black bars). FDs and 
receptor combinations are labeled along the abscissa. Numbers above the bars 
represent the fold-increase relative to FD or receptor without addition of VR or VU. 
E4LUC at the extreme right is reporter plasmid alone as control. 

Figures 7A-E are a series of six schematic diagrams representing possible 
conformations of receptor FDs described in the text. Shaded and white 
oval/rectangles represent receptors, small rectangles with interior arrows represent 
EcREs, and curvilinear lines represent linkers between protein units in the invention 
fusion proteins. Figure 7 A represents a native dimer; Figure 7B represents a 
disorganized fusion protein; Figure 7C represents an endodimer orientation of a 
single invention FD; Figure 7D represents a tetramer of two invention FDs; Figure 
7E represents a multimer of four invention FDs. 

20 Figure 8 is a series of schematic representations of invention FDs containing 

Bombyx ecdysone receptor (BEcR) plus the entire F domain of the Drosophila 
melanogaster ecdysone receptor (amino acids 650 to 878) (DE), which segment is 
included for ease in making the construct (DEcR). The BEcR is at the amino 
terminus of the fusion protein with either RXR or Usp as the dimer partner at the 

25 carboxy terminus of the fusion protein. "D" = DNA binding domain; "L" = ligand 
binding domain; curvilinear line = fusion bridge. "H" = an N-terminal His tag for 
protein purification. 
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DETAILED DESCRIPTION OF THE INVENTION 

In accordance with the present invention, there are provided chimeric proteins 
comprising at least two functional protein units, wherein each functional protein unit 
comprises the dimerization domain of a member of the steroid/thyroid hormone 
nuclear receptor superfamily, and an optional linker interposed therebetween, wherein 
the at least two protein xmits form a functional entity. When the chimeric protein 
contains two functional protein units, the chimeric protein forms a functional dimer 
(FD), for example a heterodimer or a homodimer. 

The invention chimeric proteins form functional entities (e.g. functional 
dimers) imder a variety of conditions. Such conditions include, but are not limited to, 
those at or near physiological conditions (e.g., in saline at body temperature). Those 
of skill in the art will understand that formation of invention functional entities by 
dimerization or crystallization of a macromolecule can be influenced by manipulation 
of a variety of physical parameters, such as are disclosed in McPherson, Eur. J, 
Biochem.^ 189 :1-23, 1990, which is incorporated herein by reference in its entirety. 
Due to the proximity of the protein units within the invention chimeric protein, 
dimerization tends to take place with intramolecular partners, rather than with other 
suitable monomeric dimer partners with which the protein units in the chimeric 
protein might otherwise interact. 

As used herein, plural nouns and verbs are intended to signify the singular 
form as well as the plural form of the particular noim or verb, unless prefixed by an 
adjective indicating a specific number, such as "two feet" or "three ligands", and a 
singular noun or verb is intended to include the plural form, unless prefixed by a 
phrase clearly indicating that only the singular noun or verb is intended, as in the 
phrase "one and only one foot" or "only one ligand." 

Each chimeric protein in the invention system is required to contain a 
dimerization domain of a member of the steroid/thyroid hormone nuclear receptor 
superfamily. As used herein, "dimerization domain" meems a region of a member of 
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the steroid/thyroid hormone nuclear receptor superfamily containing a sequence of 
amino acids that functions to cause dimerization of two members of the 
steroid/thyroid hormone nuclear receptor superfamily. Members of the steroid/thyroid 
hormone nuclear receptor superfamily are commonly characterized by the presence of 
5 five domains: N-terminal or activation domain (A/B), DNA binding domain (C), hinge 
domain (D), ligand binding domain (E), and C-terminal domain (F) (Evans, R. Science 
240:889-895, 1988). The dimerization domain is generally located within the region 
of the receptor molecule that is referred to as including the D, E and F domains, or is 
referred to as the "D-E-F" domain. Typically the dimerization domain includes the 
10 complete ligand binding domain (E) and may optionally include all or part of the 
§3 hinge domain (D) and/or the C-terminal region (F) of a member of the steroid/thyroid 

nuclear receptor superfamily, or a functional equivalent thereof. In some cases the 

—J 

m dimerization domain may include at least a portion of the DNA binding domain itself. 

\q Multiple domains of a given receptor can act in concert as well as independently. 

15 Therefore, as employed herein, the term "dimerization domain of a member of the 
Steroid/thyroid hormone nuclear receptor superfamily" refers to that portion (or 
12 portions) of a member of the steroid/thyroid hormone nuclear receptor superfamily 

1^ that is involved in the formation of a dimer. 

iO As used herein, the term "fusion protein" means a genetically engineered 

20 molecule in which two or more polypeptide units are fused into a single polypeptide 
molecule by fusion of the open reading frames (ORFs) encoding the two or more 
separate protein units into a single ORF. The invention fusion proteins are capable of 
forming a "functional entity" in the optional presence of ligand. When the fusion 
proteins contain two protein units, a "functional dimer (FD)" is formed by 
25 dimerization. 

As used herein, the term "functional dimer" or "functional entity" as applied 
to an invention chimeric protein means that the functional entity or dimer possesses at 
least some of the biological function of a dimer formed between two equivalent 
monomeric (i.e. xmdimerized) polypeptide units, or between two equivalent 
30 monomeric members of the steroid/thyroid hormone nuclear receptor superfamily. 
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The biological function of such dimers includes one or more of the following 
properties: DNA binding, ligand binding, transactivation, and dimerization properties 
related to transactivation of a promoter operatively associated with a response element 
responsive to the invention chimeric protein. For example, invention chimeric 
protein(s) can modulate transactivation of gene(s) whose expression is controlled by the 
presence of ligand (e.g. an invention FD wherein at least one member is a Bombyx mori 
ecdysone receptor can modulate the expression of a gene under the control oi 3, Bombyx 
ecdysone response element). 

Therefore, the term "functional protein units" as applied to the functional 
entity or dimer formed by an invention chimeric protein means that the at least two 
protein xanits in the functional entity or dimer possess a cooperative function. For 
example, in a functional dimer the two dimerization domains (e.g., the two protein 
units) fold and interact with each other in a manner appropriate to substantially 
preserve one or more of the above named biological functions in the functional dimer 
that are present when corresponding monomeric members of the dimer come together 
under physiological conditions to form a native dimer complex. 

As used herein the term "endodimer" means a dimer formed in an orientation 
approximating that of a native dimer complex formed between equivalent monomeric 
polypeptides, i.e., an "intemal" dimer. Figure 7A illustrates a native dimer complex 
and Figure 7C illustrates an invention endodimer. 

As used herein the term "dimer partner" means any polypeptide that, xinder 
physiological conditions, forms a dimer with a member of the steroid/thyroid 
hormone nuclear receptor superfamily. Such dimer partners include, but are not 
limited to, monomeric member(s) of the steroid/thyroid hormone nuclear receptor 
superfamily, including those known in the art as a "silent partner," which are 
characterized by forming dimeric species with a member of the steroid/thyroid 
superfamily of receptors wherein the silent partner may not directly participate in 
binding ligand (i.e., only the co-partner in the fusion protein binds ligand). Exemplary 
dimer partner(s) include RXR, Usp, Nurrl, and the like. 
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The term "dimer partner" is meant to include members of the steroid/thyroid 
hormone nuclear receptor superfamily to which other wild type members preferentially 
bind to form heterodimeric species. For example, wild type members of the 
steroid/thyroid hormone nuclear receptor superfamily preferentially form heterodimers 
5 with a common partner, the retinoid X (or 9-cis retinoic acid) receptor (RXR, see, for 
example, Yu etal, Cell 62:1251-1266, 1991; Bugge etaL,EMBOJ,, 11:1409-1418, 
1992; Kliewer et aL, Nature 155:446-449, 1992; Leid et al. Cell 68:377-395, 1992; 
Marks etal.EMBOJ. 11:1419-1435, 1992; Zhang a/., iVafwre 255:441-446, 1992; 
Issemann et al, Biochimie, 25:251-256, 1993). Additional dimer partners for members 
10 of the steroid/thyroid hormone nuclear receptor superfamily include ultraspiracle (Usp), 
famesoid X receptor (FXR), and the like. 

As used herein, the phrase "member(s) of the steroid/thyroid hormone nuclear 
receptor superfamily" (also known as "intracellular receptors" or "the nuclear receptor 
superfamily") refers to hormone binding proteins that operate as Ugand-dependent 
15 transcription factors, including identified members of the steroid/thyroid hormone 
nuclear receptor superfamily for which specific ligands have not yet been identified 
(referred to in the art as "orphan receptors"). 

Exemplary members of the steroid/thyroid hormone superfamily of receptors 
(including the various isoforms thereof) include steroid receptors such as 

20 glucocorticoid receptor (GR), mineralocorticoid receptor (MR), estrogen receptor 
(ER), progesterone receptor (PR), androgen receptor (AR), vitamin D3 receptor 
(VDR), and the like; plus retinoid receptors, such as the various isoforms of retinoic 
acid receptor (e.g., RARa, RARp or RARy), the various isoforms of retinoid X (or 9- 
cis retinoic acid) receptor (e.g., RXRa, RXRp, or RXRy), various isoforms of 

25 peroxisome proliferator-activated receptors (e.g., PPARa, PPARy, PPAR6) and the 
like (see, e.g., U.S. Patent Nos. 4,981,784; 5,171,671; and 5,071,773); thyroid 
hormone receptor (T3R), such as TRa, TRp, and the like; steroid and xenobiotic 

receptor (SXR, see for example, Blumberg et al.. Genes Dev (1998) 
12(20}:3 195-205), RXR-interacting proteins (RIPs; see, e.g., Seol et al., Mol 
30 Endocrinol (1995) 2a}:72-85; Zavacki et al., Proc Natl Acad Sci USA (1997) 
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94ri5V 7909-14^ including famesoid X receptor (FXR; see for example, Forman et al.. 
Cell (1995) 8U5V 687-93: Hanley et d\.,JClin Invest (1997) 100r3^ :705-12, O'Brien 
et al.. Carcinogenesis (1996) rZ£2): 185-90), pregnenolone X receptor (PXR; see for 
example, Schuetz et al., Mol Pharmacol (1998) 54(6}:1 1 13-7), liver X receptor (LXR, 
see, e.g., Peet et al., Curr Opin Genet Dev (1998) Si;5):571-5), BXR (Blumberg et al.. 
Genes Dev (1998) 12(9): 1269-77), insect derived receptors such as the ecdysone 
receptor (EcR), the ultraspiracle receptor (see, for example, Oro et al., in Nature 
342:298-301 (1990)), and the like; as well as other gene products which, by their 
structure and properties, are considered to be members of the superfamily, as defined 
hereinabove, including the various isoforms thereof (see, e.g., Laudet, V., J Mol 
Endocrinol (1997) 12(3}:207-26). 

Examples of orphan receptors contemplated for use herein include HNF4 (see, 
for example, Sladek et al.. Genes & Development 4:2353-2365 (1990)), the COUP 
family of receptors (see, for example, Miyajima et al., in Nucleic Acids Research 
1^:1 1057-1 1074 (1988), and Wang et al.. Nature 340:163-166 (1989)), COUP-like 
receptors and COUP homologs, such as those described by Mlodzik et al.. Cell 
6Q:21 1-224 (1990) and Ladias et al.. Science 211:561-565 (1991), orphan receptor 
(OKI; see, e.g., Feltkamp et al., J Biol Chem (1999) 224115): 1042 1-9), the insect 
derived knirps and knirps-related receptors, short heterodimer partner (SHP; see, e.g., 
Seol et al., Mol Cell Biol (1997) 12112): 7 126-3 1), hepatocyte nuclear receptor 4 
(HNF4), constitutive androstane receptor (CAR; see, e.g., Forman et al.. Nature 
(1998) 395(6702^ :612-5^ and the like. 

Each protein unit in the invention chimeric protein is required to contain at least 
a dimerization domain, optionally, the entire ligand binding domain, an optional hinge 
domain, and an optionally functional DNA binding domain of a member of the 
steroid/thyroid nuclear receptor superfamily, or a functional equivalent thereof. For 
use in the invention methods for modulating the transcription of exogenous or 
endogenous nucleic acids in a host, the ligand binding domains are either endogenous or 
non-endogenous to a host, with the latter including ligand binding domains that are 
modified to be non-responsive to ligands endogenous or native to the host. In 
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embodiments wherein the ligand binding domain is derived from non-mammalian 
member(s) of the steroid/thyroid horaione nuclear receptor superfamily, which members 
are not normally present in the cells of a host, the ligand binding domains are preferably 
derived from the carboxy-terminal portion of non-mammalian members. Exemplary 
5 members that are not normally present in mammalian cells include insect, avian, 

amphibian, reptilian, fish, plant, bacteria, viral and fimgal (including yeast) members of 
the steroid/thyroid hormone nuclear receptor superfamily, and the like. 

Exemplary ligand binding domains derived from insect receptors include those 
derived from lepidopteran species such as Drosophila melanogaster (M.R. Koelle, 
1995), Bombyx mori (Swevers et al, Insect Biochem. Molec. Biol, 25(7):857-866, 
1995), Choristoneura fumiferana (PalU et al, Insect Biochem, Molec, Biol, 26(5):485- 
499, \996\ Manduca sexta (¥n]vwdirdi et al, Insect Biochem, Molec, Biol, 25(7):845- 
856, 1995), Aedes aegypti (Cho et al. Insect Biochem Molec, Biol, 25: 19-27, 1995), 
Chorinomus tentans (Imhof et al. Insect Biochem, Molec. Biol, 25:115-124, 1993), and 
the like. 

When the fimctional protein units included in the invention chimeric protein lack 
a substantial portion of the C-terminal "F" domain in the dimerization domain of a native 
member of the steroid/thyroid hormone nuclear receptor superfamily, a functional 
protein iinit that is less than about 700 amino acids in length is provided. 

Ligand binding domains can be ftinctionally located in either orientation and at 
various positions within the protein unit. For example, the ligand binding domain can be 
positioned at either the amino or carboxy terminus of the protein unit in the invention 
chimeric protein, or therebetween. In a preferred embodiment of the present invention, 
25 the ligand binding domain is positioned at the carboxy terminus of the protein unit (see 
Figure 1). 

The optional hinge region, when present, can also be functionally located in 
either orientation and at various positions within the protein unit. For example, the hinge 
region can be positioned at either the amino or carboxy terminus of the protein unit, or 
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therebetween. Preferably, the hinge region is positioned internally between the ligand 
binding and DNA binding domains of one or more of the members in the chimeric 
protein. The hinge region bounded by the ligand binding domain and DNA binding 
domain of the native Bombyx mori receptor (BEcR), specifically, about 27 amino acid 
5 residues (i.e. amino acid residues 283-309, in the hinge region of BEcR) are sufficient to 
confer high affinity for complex formation with an endogenous dimer partner (see U.S. 
Patent Application Serial No. 08/891,298, filed July 10, 1997, copending herewith). 

Each protein unit in the invention chimeric protein also optionally contains a 
DNA binding-domain. DNA-binding domains contemplated for use in the preparation 
of invention chimeric proteins are well known in the art and are typically obtained from 
DNA-binding proteins (e.g., transcription factors). The term "DNA-binding domain" is 
understood in the art to refer to an amino acid sequence that is able to bind to DNA. As 
used herein, the term "DNA-binding domain" encompasses a minimal peptide sequence 
of a DNA-binding protein up to the entire length of a DNA-binding protein, so long as 
the DNA-binding domain functions to associate with a particular regulatory element. 

DNA-binding domains are known to function heterologously in combination 
with other fimctional domains by maintaining the ability to bind the natural DNA 
recognition sequence (see, e.g.. Brent and Ptashne, CelU 43:729-736, 1985). For 
example, v^th respect to steroid/thyroid hormone nuclear receptors, DNA-binding 
20 domains are interchangeable, thereby providing numerous chimeric receptor proteins 

(see, e.g., U.S. Patent 4,981,784; and R. Evans, Science, 240:889-895, 1988). Similar to 
the ligand binding domain, the DNA-binding domain can be positioned at either the 
carboxy terminus or the amino terminus of a protein unit in the invention chimeric 
protein, or the DNA-binding domain can be positioned between the ligand binding 
25 domain and the activation domain. In preferred embodiments of the present invention, 
the DNA-binding domain is positioned internally between the ligand binding domain 
and the activation domain. 

"DNA-binding protein(s)" contemplated for use herein belong to the well-known 
class of proteins that are able to directly bind DNA and facilitate initiation or repression 
30 of transcription. Exemplary DNA-binding proteins contemplated for use herein include 
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transcription control proteins (e.g., transcription factors and the like; see, for example, 
Conaway and Conaway, Transcription Mechanisms and Regulation^ Raven Press Series 
on Molecular and Cellular Biology, Vol. 3, Raven Press, Ltd., New York, NY, 1994). 

Transcription factors contemplated for use herein as a source of such DNA 
binding domains include, e.g., homeobox proteins, zinc finger proteins, hormone 
receptors, helix-tum-helix proteins, helix-loop-helix proteins, basic-Zip proteins (bZip), 
p-ribbon factors, and the like. See, for example, S. Harrison, "A Structural Taxonomy of 
DNA-binding Domains," Nature, 253:715-719. Homeobox DNA-binding proteins 
suitable for use herein include, for example, HOX, STF-1 (Leonard et al, Mol Endo. , 
2:1275-1283, 1993), Antp, Mat a-2, INV, and the like. See, also, Scott et al Biochem. 
Biophys. Acta, 282:25-48, 1989. It has been found that a fragment of 76 amino acids 
(corresponding to amino acids 140-215 described in Leonard et al, 1993) containing the 
STF-1 homeodomain binds DNA as tightly as wild-type STF-1 . Suitable zinc finger 
DNA-binding proteins for use herein include Zif268, GLI, XFin, and the like. See also, 
Klug and Rhodes, Trends Biochem. 5c/., 12:464, 1987; Jacobs and Michaels, New Biol, 
2:583, 1990; and Jacobs, ^MBOJ;, 11:4507-4517, 1992. 

An additional DNA binding domain contemplated for use in the practice of the 
present invention is the GAL4 DNA binding domain. The DNA binding domain of the 
yeast GAL4 protein comprises at least the first 74 amino terminal amino acids thereof 
(see, for example, Keegan et al. Science 231:699-704, 1986). Preferably, the first 90 or 
more amino terminal amino acids of the GAL4 protein will be used, for example, the 
147 amino terminal amino acid residues of yeast GAL4 . 

The DNA-binding domain(s) used in the invention chimeric proteins can be 
obtained from a member of the steroid/thyroid hormone nuclear receptor superfamily, or 
25 are substantially the same as those obtained from a member of the superfamily. The 
DNA-binding domains of all members of the steroid/thyroid hormone nuclear receptor 
superfamily are related. Such domains consist of 66-68 amino acid residues, and possess 
about 20 invariant amino acid residues, including nine cysteines. Members of the 
superfamily are characterized as proteins which contain these 20 invariant amino acid 
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residues. The highly conserved amino acids of the DNA-bmding domain of members of 
the superfamily are as follows: 

Cys - X - X - Cys - X - X - Asp* - X - Ala* - X - Gly * - 
X - Tyr* - X - X - X - X - Cys - X - X - Cys - Ly s* - X - 
Phe - Phe - X - Arg* - X - X - X - X - X -(X - X -) Cys - 
X -X - X - X - X - (X - X - X -) Cys - X -X - X - Lys - X - 
X - Arg - X - X - Cys - X - X - Cys - Arg* - X - X - 
Lys* - Cys - X - X - X - Gly* - Met (SEQ ID NO: 1); 

wherein X designates non-conserved amino acids within the DNA-binding domain; an 
asterisk denotes the amino acid residues which are almost universally conserved, but for 
which variations have been found in some identified hormone receptors; and the residues 
enclosed in parenthesis are optional residues (thus, the DNA-binding domain is a 
minimum of 66 amino acids in length, but can contain several additional residues). 

Invention chimeric proteins are optionally modified by the introduction of an 
activation domain subunit. Activation domains contemplated for use in the practice of 
the present invention are well known ui the art and can readily be identified by those of 
skill in the art. Such activation domains are typically derived fi-om transcription factors 
and comprise a contiguous sequence that fiuictions to activate gene expression when 
associated v^th a suitable DNA-binding domain and a suitable ligand binding domain. 
An activation domain can be positioned at any convenient site v^thin tiie invention 
chimeric protein, e, g., at the carboxy terminus, the amino terminus, or between the 
ligand binding domain and the DNA bmding domain within one or both protein units of 
the chimeric protein. In presently preferred embodiments of the invention, the activation 
domain is positioned at the amino terminus of the invention chimeric protein. 

Suitable activation domains can be obtained fi-om a variety of sources, e.g., from 
the N-terminal region of members of the steroid/thyroid hormone nuclear receptor 
superfamily, fi:'om transcription factor activation domains, such as, for example, VP 16, 
GAL4, NF-kB or BP64 activation domains, and the like. The activation domain 
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presently preferred for use in the practice of the present invention is obtained from the C- 
terminal region of the VP 16 protein, and is known as VP16t. 

In a presently preferred embodiment of the present invention, chimeric 
proteins contain one or more ecdysone receptors (EcR) as the steroid/thyroid hormone 
nuclear receptor, for example, a Drosophila EcR (DEcR) or a Bombyx EcR (BEcR). 
The chimeric protein further comprises either RXR or ultraspiracle protein (Usp) as 
an additional functional protein unit. The preferred order Mdthin the chimeric protein 
is for the EcR to be located at the amino terminus of the chimeric protein. However, 
when the invention chimeric protein further comprises an activation domain, the 
activation domain is preferably located at the amino terminus of the chimeric protein. 

The EcR, an insect receptor, differs in two respects from other known 
steroid/thyroid hormone nuclear receptor superfamily. First, EcR has very different 
docimiented relationships with two similar dimer partners: its natural partner, Usp, 
and the mammalian homolog, RXR. EcR also differs from other members of the 
steroid/thyroid hormone nuclear receptor superfamily in that its apparent affinity to 
these heterodimer partners varies depending on the presence of ligand. 

To facilitate dimerization of the dimerization domains in the invention 
chimeric proteins, the at least two protein imits of the chimeric protein preferably 
have the ligand binding domain, hinge domain, and DNA binding domain in the same 
order within each protein xmit. If the chimeric protein additionally contains an 
activation domain, the activation domain is preferably located at the amino terminus 
of the chimeric protein, ahead of the first unit thereof, as illustrated in Examples 1 and 
4 herein. 

Invention chimeric protein(s) optionally further contain a linker interposed 
between one or more of the protein units. The protein units can be independently 
oriented amino terminus to carboxy terminus within the chimeric protein, or visa 
versa . For example, the linker can be placed between the carboxy terminus of the 
first protein unit and the amino terminus of the second protein unit. Any type of 
linker known in the art can be used for linking the protein imits in invention chimeric 
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proteins so long as the linker is flexible and does not interfere with dimerization 
between protein units in the invention chimeric proteins. 

In one embodiment according to the present invention, the linker is a 
heterobifunctional cleavable cross-linker, such as N-succinimidyl (4-iodoacetyl)- 
5 aminobenzoate; sulfosuccinimidyl(4-iodoacetyl)-aminobenzoate; 4-succinimidyl- 
oxycarbonyl-a-(2-pyridyldithio) toluene ; sulfosuccinimidyl-6-[a-methyl-a- 
(pyridyldithiol)-toluamido] hexanoate; N-succinimidyl-3-(-2-pyridyldithio)- 
proprionate; succinimidyl-6-[3(-(-2-pyridyldithio)-proprionamido] hexanoate; 
sulfosuccinimidyl-6-[3(-(-2-pyridyldithio)-propionamido] hexanoate; 3-(2- 

10 pyridyldithio)-propionyl hydrazide, EUman's reagent, dichlorotriazinic acid, S-(2- 
thiopyridyl)-L-cysteine, and the like. Further bifunctional linking compounds are 
disclosed in U.S. Patent Nos. 5,349,066. 5,618,528, 4,569,789, 4,952,394, and 
5,137,877, each of which is incorporated herein by reference in its entirety. These 
chemical linkers can be attached to purified proteins using numerous protocols known 

15 in the art, such as those described in Pierce Chemicals "Solutions, Cross-linking of 
Proteins: Basic Concepts and Strategies," Seminar #12, Rockford, IL. 

In another embodimen^ccording to the present invention, the linker can be a 
peptide having fi-om about 2 to aV)ut 60 amino acid residues, for example fi-om about 
5 to about 40, or fi-om about 10 to s^bout 30 amino acid residues, such as is knovm in 
20 single-chain antibody research. Examples of such known linker moieties include 
GGGGS (SEQ ID N0:2), (GGGGS)n (SEQ. ID NO:3), GKSSGSGSESKS (SEQ ID 
NO:4), GSTSGSGKSSEGKG (SEQ. IdW):5), GSTSGSGKSSEGSGSTKG (SEQ 
ID NO:6), GSTSGSGKSSEGKG (SEQ ro^O:7), GSTSGSGKPGSGEGSTKG 
(SEQ ID NO:8), EGKSSGSGSESKEF (SEQJD NO:9), SRSSG (SEQ. ID NO: 10), 
25 SGSSC (SEQ ID NO: 11), and the like. A Diphtheria toxin trypsin sensitive linker 
having the sequence AMGRSGGGCAGNRVG^SLSCGGLNLQAM (SEQ ID 
NO: 12) is also useful. Alternatively, the peptide linker moiety can be VM or AM, or 
have the structure described by the formula: AM(G\ to 4S)xAM wherein X is an integer 
from 1 to 1 1 (SEQ ID NO: 13). Additional linking moieties are described, for 
30 example, in Huston et al, P//^*? 85:5879-5883, 1988; Whitlow, M., et al. Protein 
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Engineering 6:989-995, 1993; Newton et al. Biochemistry 21' 5^5-553, 1996; A. J. 
Cumber et al, Bioconj. Chem, 1:394-401, 1992; Ladumer et al, J. Mol Biol 
221:330-337, 1997; and U.S. Patentee. 4;894,443, the latter of which is incorporated 
herein by reference in its entirety. 

5 Generally, however, the linker contains from about 5 to about 245 amino 

acids, although there is no theoretical upper limit on the number of amino acids that 
could be used in the linker. Preferably, the linker contains from about 53 to about 125 
amino acids. The amino acids in the linker protein are preferably selected to provide 
flexibility to the linker. Preferably, a multiplicity of flexibility enhancing amino 
10 acids, such as proline, glycine, alanine and serine, are incorporated into the linker to 
f. enhance its flexibility. 



Assxmiing a span of approximately 3.35 angstroms per amino acid within the 
flexible peptide bridge encoded by a 36-base pair Sfil compatible oligonucleotide, the 
predicted minimum and maximum distance for the lengths of the linker having from 0 

15 to 20 linker segments ranges from about 16.75 angstroms (the 5 amino acid bridge) to 
804 angstroms (20-linker segments + the 5 amino acid bridge). Thus, the length of 
the linker can readily be selected to enhance dimerization between any two particular 
members acting as dimer partners by including as many linker segments as is 
preferred to enhance the biological fimctions of the fimctional dimer, as discussed 

20 herein. 

In a presently preferred embodiment, the nucleotide encoding the polypeptide 
linker contains a restriction endonuclease recognition site which produces an 
overhang composed of non-palindromic center bases to allow for insertion of 
compatible inserts in a viniform orientation and in continually in-frame blocks along 

25 the length of the polypeptide linker. This type of linker allows incremental expansion 
of the linker peptide to produce chimeric proteins containing linkers with a range of 
distances between the protein xmits. The nucleotide encoding the linker preferably 
contains a rare 8-base-pair Sfil recognition site that is usefiil in making constructs 
with linkers of variable length. In addition, the nucleotides composing the recognition 

30 site: 
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-GGCCNNNNNGGCC- (SEQ ID NO: 14) 

are guanidines and cytosines, which can be oriented in frame to encode glycine and 
proline residues in accordance with the criteria of producing a "flexible" protein 
linker for junction of the two units of the chimeric protein. Any bases can be used as 
the "N" nucleotides contained within the recognition site, allowing further flexibility 
in the design of the linker. 

A presently preferred linker amino acid sequence is GPGGGSGGGSGT (SEQ 
ID NO: 15), which provides a high degree of predicted flexibility while minimizing 
repetitive sequence within the encoding oligonucleotide. 

In accordance with another embodiment of the present invention, there are 
provided nucleotides encoding invention chimeric protein(s) and cells containing such 
nucleotides. Cells containing invention polynucleotides can be either mammalian or 
non-mammalian, for example, plant or fungi cells, and the like. 

In accordance with another embodiment of the present invention, there are 
provided methods for modulating the expression of exogenous gene(s) in a subject 
organism containing: 

1) a functional entity according to the invention and 

2) a DNA construct encoding and expressing the exogenous 
gene(s) under the control of a response element responsive to the functional 
dimer. 

Invention methods for modulating exogenous gene expression in such a subject 
organism comprise administering to the subject organism an effective amount of at 
least one ligand for the functional dimer. 

Ligand is selected to activate at least one functional unit of the functional entity. 
When the functional entity is a functional dimer, the ligand is usually selected to activate 
the dominant member of the functional dimer. For example, if one of the two members 
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in the functional dimer is a silent dimer partner, the ligand is selected to activate the 
member in the functional dimer that does not act as a silent dimer partner. 

In a presently preferred embodiment of the present invention, one of the 
members in the functional entity is from an insect species, and the preferred ligand is an 
insect hormone. For example, preferred insect receptors are the Drosophila ecdysone 
receptor or the Bombyx ecdysone receptor, and the preferred dimer partner with these 
insect receptors in the invention chimeric protein is either the ultraspiracle protein or a 
retinoid X receptor. These functional entities complex with the ecdysone response 
element, generally in the presence of ligand for the functional dimer formed by the 
invention chimeric protein, but in some instances the presence of ligand is not required 
to form a functional entity/response element complex, as explained more fully 
hereinbelow. 

As employed herein, the terms "modulate" and "modulating" refer to the ability 
of a given functional entity to activate/deactivate and/or up-regulate/down-regulate 
transcription of exogenous nucleic acids, relative to the transactivation activity in the 
absence of the functional entity. 

The actual effect of an invention functional entity on the transcription of 
exogenous or endogenous nucleic acids will vary depending on the particular 
combination of dimerization domains and/or members of the steroid/thyroid hormone 
nuclear receptor superfamily in the chimeric protein, on the presence or absence of 
specific ligand for the ligand binding domain(s) employed in the chimeric protein, and 
on the regulatory element (e.g., response element) with which the selected chimeric 
protein interacts. It is specifically contemplated within the scope of the present invention 
that modulation includes repression of expression of one or more genes. Such repression 
can be either ligand-dependent repression or repression that occurs independently of the 
presence of a ligand. Thus, there are four types of modulation contemplated within the 
scope of the invention: ligand-dependent induced modulation, ligand-dependent 
repressed modulation, ligand-independent induced modulation, and ligand independent 
repressed modulation. The ligand can be either exogenous or endogenous to the subject 
treated for modulation of expression of an exogenous gene. 
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More particularly, the type of modulation that results from the practice of the 
invention method (i.e., whether activation or repression of expression of the exogenous 
gene) depends upon the combination of dimerization domains and/or members of the 
steroid/thyroid hormone nuclear receptor superfamily contained within a functional 
5 entity formed by the invention chimeric protein. For example, it has been determined 
that activation of expression of the exogenous gene(s) according to the invention 
modulation method can be achieved if the dimer partners in an invention FD used in the 
invention method of modulation are an ecdysone receptor and a retinoid X receptor, e.g. 
EOR or E5R. Ligands suitable for activating expression of the exogenous gene(s) when 
10 such functional dimers are employed in the invention methods include muristerone A, 
20-hydroxyecdysone, phytoecdysteroid(s), and the like. 

''f On the other hand, it has been determined that expression of an exogenous 

1^ gene(s) can be repressed independently of (i.e., with or without) the presence of ligand if 

Cl an invention FD comprising an ecdysone receptor (e.g., either a Drosophila ecdysone 

15 receptor or a Bombyx ecdysone receptor) and an ultraspiracle protein as dimer partner is 
\^ used in the invention method, e.g., E5U, and the like. Ligands suitable for activating 

jr: expression of the exogenous gene(s) when such functional dimers are employed in the 

!□ invention methods are 20-hydroxyecdysone, muristerone A, phytoecdysteroid(s), and the 

*^ like. In addition, expression of exogenous gene(s) can be repressed independently of 

20 (i.e., with or without) the presence of ligand if an invention FD comprising a Bombyx 
ecdysone receptor and an RXR as dimer partner is used in the invention method. As 
shown in Example 6 herein, an N-terminal His tag on the chimeric protein to aid in 
protein purification does not effect binding of the chimeric protein to a suitable response 
element so as to repress expression of the exogenous gene according to invention 
25 methods for modulating expression of exogenous gene(s)- 

Accordingly, in another embodiment of the present invention, there are provided 
methods for modulating (i.e., either activating or repressing) the expression of one or 
more genes in a subject organism independently of the presence of ligand for the 
invention chimeric protein. If the subject organism contains an invention chimeric 
30 protein, the invention method comprises introducing to the subject an exogenous 
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response element(s) with which the chimeric protein interacts and which controls 
expression of the one or more genes, thereby modulating expression of the gene(s) 
independent of the presence of ligand for the chimeric protein. On the other hand, if 
the subject organism contains an exogenous response element(s) controlling 
expression of the one or more genes, the invention method comprises introducing to 
the subject an invention chimeric protein with which the response element interacts, 
thereby modulating expression of the gene(s) independent of the presence of ligand 
for the chimeric protein. 

In accordance with another embodiment of the present invention, there are 
provided methods for modulating (i.e., either activating or repressing) the expression of 
one or more exogenous genes independent of ligand for the chimeric protein. If the 
subject contains a chimeric protein according to the invention,, the invention method 
comprises introducing to the subject an effective amount of a response element, wherein 
the response element is responsive to the chimeric protein and wherein the modulation is 
independent of ligand for the chimeric protein. The modulation can be ligand 
independent activation or ligand independent repression. 

In accordance with another embodiment of the present invention, there are 
provided methods for modulating (i.e., either activating or repressing) the expression of 
one or more exogenous genes in a cell containing: 

1) an invention chimeric protein and 

2) a DNA construct comprising the exogenous gene under the 
control of a response element with which the chimeric protein interacts, 
wherein said response element controls expression of the exogenous gene, 
said method comprising administering to the cell an effective amount of an 

exogenous ligand for at least one functional unit of the chimeric protein. 

In accordance with another embodiment of the present invention, there are 
provided methods for modulating the expression of one or more genes in a subject 
organism containing an endogenous response element controlling expression of one or 
more genes. The invention method in this situation comprises introducing to the 
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subject an invention chimeric protein, wherein the chimeric protein interacts with the 
response element, thereby modulating expression of the gene(s) dependent on the 
presence of endogenous ligand therefor. The chimeric protein is encoded by an 
inducible DNA construct and the modulating comprises inducing expression of the 
gene(s). 

In another embodiment according to the present invention, there are provided 
methods for modulating the expression of one or more genes in a subject organism 
containing an endogenous response element controlling expression of one or more 
genes and an endogenous ligand. The invention method comprises introducing to the 
subject an invention chimeric protein that interacts with the endogenous ligand and 
wherein the chimeric protein interacts with the response element, thereby modulating 
expression of the gene(s) dependent on the presence of the endogenous ligand. If the 
invention chimeric protein is encoded by an inducible DNA construct, the modulating 
further comprises inducing expression of the chimeric protein. This embodiment of 
the invention is especially useful for controlling expression of an exogenous gene that is 
under the control of an endogenous response element wherein the ligand for the 
invention functional dimer is also endogenous. 

Response elements contemplated for use in the practice of the present invention 
(relating to modulation of the expression of exogenous genes in a subject) include native, 
as well as modified response elements. For example, since invention functional dimers 
can function as either homodimers or as heterodimers (v^th a silent partner therefor), any 
response element that is responsive to an invention functional dimer, in the form of a 
homodimer or heterodimer, is contemplated for use in the invention methods described 
herein. As is readily recognized by those of skill in the art, invention functional dimers 
(whether in the form of a homodimer or a heterodimer) can bind to a response element 
having an inverted repeat motif (i.e., two or more half sites in mirror image orientation 
with respect to one another), to a response element having a direct repeat motif, and the 
like. 
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Response elements useful in conjunction with invention functional entities are 
those well known in the art. As readily recognized by those of skill in the art, the 
response element employed will vary as a function of the protein units incorporated into 
the functional entity. Thus, for example, retinoic acid receptor response elements are 
composed of at least one direct repeat of two or more defined half sites separated by a 
spacer of five nucleotides. The spacer nucleotides can independently be selected from 
any one of A, C, G or T. Each half site of response elements contemplated for use in 
the practice of the invention comprises the sequence: 

-RGBNNM-, 
wherein 

R is selected from A or G; 
B is selected from G, C, or T; 

each N is independently selected from A, T, C, or G; and 
M is selected from A or C; 

v^th the proviso that at least 4 nucleotides of said -RGBNNM- sequence are identical 
with the nucleotides at corresponding positions of the sequence -AGGTCA-. Response 
elements employed in the practice of the present invention can optionally be preceded 
by Nx, wherein x falls in the range of 0 up to 5. 

For example, thyroid hormone receptor response elements can be composed of 
the same half site repeats, with a spacer of four nucleotides. Alternatively, palindromic 
constructs as have been described in the art are also functional as TR response elements. 

Exemplary GAL4 response elements are those containing the palindromic 17- 

mer: 

5'-CGGAGGACTGTCCTCCG-3' (SEQ ID NO:16), 

such as, for example, 17MX, as described by Webster et al., in Cell 52:169-178 (1988), 
as well as derivatives thereof Additional examples of suitable response elements 
include those described by HoUenberg and Evans in Cell 55:899-906 (1988); or 
Webster et al. in Cell 54:199-207 (1988). 



• 
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Ecdysone response element sequences are preferred for use herein with 
functional dimers containing an ecdysone receptor function in a position- and 
orientation-independent fashion. The native ecdysone response element has been 
previously described, see, e.g., Yao et al., Ce//, 21:63-72, 1992. 

In the invention methods the operative response element is functionally linked 
to an operative exogenous gene(s) whose expression it is desirable to control. The word 
"operative" means that the respective DNA sequences (represented by the terms 
"response element" and "exogenous or endogenous gene") are operational, i.e., work 
for their intended purposes; the word "functionally" means that after the two segments 
are linked, upon appropriate activation by a functional dimer/ligand complex, the 
exogenous gene(s) will be expressed as the result of the fact that the "response element" 
was "tumed on" or otherwise activated. 

Certain nucleic acid constructs contemplated for use in one aspect of the present 
invention include promoters and regulatory elements operatively associated with 
exogenous nucleic acids. In one embodiment of the present invention, the invention 
functional dimer, in the presence of a ligand therefor, binds the regulatory element and 
activates transcription of one or more exogenous nucleic acids. For example, an 
invention functional dimer containing the protein units RXR and EcR will transactivate 
an ecdysone response element-containing promoter in the presence of the hormone 
ecdysone, or the synthetic analog, muristerone A. 

Regulatory elements contemplated for use in the practice of the present invention 
include elements responsive to the invention receptor peptide. In a preferred 
embodiment of the present invention, such elements are exogenous regulatory elements 
not normally present in the cells of the host. One class of exogenous regulatory elements 
25 contemplated for use herein includes hormone response elements that modulate 

transcription of exogenous nucleic acid when bound to the DNA binding domain of an 
invention receptor peptide. 




Regulatory elements employed in the practice of the present invention are 
operably linked to a suitable promoter for transcription of exogenous nucleic acid(s) 
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product(s). As used herein, the term "promoter" refers to a specific nucleotide sequence 
recognized by RNA polymerase, the enzyme that initiates RNA synthesis. The promoter 
sequence is the site at which transcription can be specifically initiated under proper 
conditions. When exogenous nucleic acid(s), operatively linked to a suitable promoter, 
5 is(are) introduced into the cells of a suitable host, expression of the exogenous nucleic 
acid(s) is(are) controlled in many, but not all cases, by the presence of ligands, which are 
not normally present in the host cells. 

Promoters contemplated for control of expression of exogenous nucleic acids 
employed in the practice of the present iavention include inducible (e.g., minimal CMV 
10 promoter, minimal TK promoter, modified MMLV LTR), constitutive (e.g., chicken (}- 
actin promoter, MMLV LTR (non-modified), DHFR), and/or tissue specific promoters. 

Inducible promoters contemplated for use in the practice of the present invention 
comprise transcription regulatory regions that fimction maximally to promote 
transcription of mRNA under inducing conditions. Examples of suitable inducible 

15 promoters include DNA sequences corresponding to: the E. coli lac operator responsive 
to IPTG (see Nakamura et al. Cell, 18:1 109-1 117, 1979); the metallothionein promoter 
metal-regulatory-elements responsive to heavy-metal (e.g., zinc) induction (see Evans et 
al, U.S. Patent No. 4,870,009), the phage T71ac promoter responsive to IPTG (see 
Studier et al, Meth Enzymol, 185: 60-89, 1990; and U.S. Patent No. 4,952,496), the 

20 heat-shock promoter; the TK minimal promoter; the CMV minimal promoter; a 
synthetic promoter; and the like. 

Exemplary constitutive promoters contemplated for use in the practice of the 
present invention include the CMV promoter, the S V40 promoter, the DHFR promoter, 
the mouse mammary tumor virus (MMTV) steroid-inducible promoter, Moloney murine 

25 leukemia virus (MMLV) promoter, elongation factor la (EFla) promoter, albumin 
promoter, APO Al promoter, cyclic AMP dependent kinase II (CaMKII) promoter, 
keratin promoter, CD3 promoter, immunoglobulin light or heavy chain promoters, 
neurofiliment promoter, neuron specific enolase promoter, L7 promoter, CD2 promoter, 
myosin light chain kinase promoter, HOX gene promoter, thymidine kinase (TK) 

30 promoter, RNA Pol II promoter, MYOD promoter, MYF5 promoter. 
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phosphoglycerokinase (PGK) promoter, Stfl promoter. Low Density Lipoprotein (LDL) 
promoter, chicken b-actin promoter (used in conjunction with ecdysone response 
element), and the like. 

As readily understood by those of skill in the art, the term "tissue specific" refers 
to the substantially exclusive initiation of transcription in the tissue from which a 
particular promoter that drives expression of a given gene is derived (e.g., expressed 
only in T-cells, endothelial cells, smooth muscle cells, and the like). Exemplary tissue 
specific promoters contemplated for use in the practice of the present invention include 
the GH promoter, the NSE promoter, the GFAP promoter, neurotransmitter promoters 
(e.g., tyrosine hydroxylase, TH, choline acetyltransferase, ChAT, and the like), 
promoters for neurotropic factors (e.g., a nerve growth factor promoter, NT-3, BDNF 
promoters, and the like), and so on. 

As used herein, when referring to nucleic acids, the phrase "exogenous to said 
mammalian host" or simply "exogenous" refers to nucleic acids not naturally found at 
levels sufficient to provide a fimction in the particular cell where transcription is desired. 
For example, exogenous nucleic acids can be either natural or synthetic nucleic acids, 
which are mtroduced into the host in the form of DNA or RNA. The nucleic acids of 
interest can be introduced into target cells (for in vitro appUcations), or the nucleic acids 
of interest can be introduced directly or indirectly into a host, for example, by the 
transfer of transformed cells into a host. 

In contrast to exogenous nucleic acids, the phrase "endogenous nucleic acids" or 
"endogenous genes" refers to nucleic acids naturally found at levels sufficient to provide 
a function in the particular cell where transcription is desired. 

Exogenous nucleic acids contemplated for use in the practice of the present 
25 invention include wild type and/or therapeutic nucleic acids. "Wild type" genes are 

those that are native to cells of a particular type. Exemplary wild type nucleic acids are 
genes which encode products the substantial absence of which leads to the occurrence of 
a non-normal state in a host; or a substantial excess of which leads to the occurrence of a 
non-normal state in a host. 
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Such genes may not be expressed in biologically significant levels or may be 
undesirably overexpressed. Thus, for example, while a synthetic or natural gene coding 
for human insulin would be exogenous genetic material to a yeast cell (since yeast cells 
do not naturally contain insulin genes), a human insulin gene inserted into a human skin 
5 fibroblast cell would be a wild type gene with respect to the fibroblast since human skin 
fibroblasts contain genetic material encoding human insulin, although human skin 
fibroblasts do not express human insulin in biologically significant levels. 

Therapeutic nucleic acids contemplated for use in the practice of the present 
invention include those which encode products which are toxic to the cells in which they 
are expressed; or encode products which impart a beneficial property to a host; or those 
which transcribe nucleic acids which modulate transcription and/or translation of 
endogenous genes. 

As employed herein, the phrase "therapeutic nucleic acids" refers to nucleic 
acids that impart a beneficial function to the host in which such nucleic acids are 
transcribed. Therapeutic nucleic acids are those that are not naturally found in host cells. 
For example, synthetic or natural nucleic acids coding for wild type human insulin 
would be therapeutic when inserted into a skin fibroblast cell so as to be expressed in a 
human host, where the human host is not otherwise capable of expressing fiinctionally 
active human insulin in biologically significant levels. Further examples of therapeutic 
nucleic acids include nucleic acids that transcribe antisense constructs used to suppress 
the expression of endogenous genes. Such antisense transcripts bind endogenous nucleic 
acid (mRNA or DNA) and effectively cancel out the expression of the gene. In 
accordance with the methods described herein, therapeutic nucleic acids are expressed at 
a level that provides a therapeutically effective amount of the corresponding therapeutic 
protein. 

Exogenous nucleic acids useful in the practice of the present invention include 
genes that encode biologically active proteins of interest, such as, e.g., secretory proteins 
that can be released from said cell; enzymes that can metabolize a toxic substance to 
produce a non-toxic substance, or that metabolize an inactive substance to produce a 
30 usefiil substance; regulatory proteins; cell surface receptors; and the like. Useful genes 
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include genes that encode blood clotting factors, such as human factors VIII and IX; 
genes that encode hormones, such as insulin, parathyroid hormone, luteinizing hormone 
releasing factor (LHRH), alpha and beta seminal inhibins, and human growth hormone; 
genes that encode proteins, such as enzymes, the absence of which leads to the 
occurrence of an abnormal state; genes encoding cytokines or lymphokines such as 
interferons, granulocytic macrophage colony stimulating factor (GM-CSF), colony 
stimulating factor-1 (CSF-1), tumor necrosis factor (TNF), and erythropoietin (EPO); 
genes encoding inhibitor substances such as alphai -antitrypsin; genes encoding 
substances that function as drugs, e.g., genes encoding the diphtheria and cholera toxins; 
and the like. 

Additional nucleic acids contemplated for use in accordance with the present 
invention include genes that encode proteins present in dopaminergic neurons (useful, 
for example, for the treatment of Parkinson's disease), cholinergic neurons (useful, for 
example, for the treatment of Alzheimer's disease), hippocampal pyramidal neurons 
(also ixseful for the treatment of Alzheimer's disease), norepinephrine neurons (useful, 
for example, for the treatment of epilepsy), spinal neurons (useful, for example, for the 
treatment of spinal injury), glutamatergic neurons (useful, for example, for the treatment 
of schizophrenia), cortical neurons (useful, for example, for the treatment of stroke and 
brain injury), motor and sensory neurons (useful, for example, for the treatment of 
amyotrophic lateral sclerosis), and the like. 

Typically, nucleic acid sequence information for proteins encoded by exogenous 
nucleic acid(s) contemplated for use employed herein can be located in one of many 
public access databases, e.g., GENBANK, EMBL, Swiss-Prot, and PIR, or in related 
journal publications. Thus, those of skill in the art have access to sequence information 
for virtually all known genes. Those of skill in the art can obtain the corresponding 
nucleic acid molecule directly from a public depository or from the institution that 
published the sequence. Optionally, once the nucleic acid sequence encoding a desired 
protein has been ascertained, the skilled artisan can employ routine methods, e.g., 
polymerase chain reaction (PGR) amplification, to isolate the desired nucleic acid 
molecule from the appropriate nucleic acid library. Thus, all known nucleic acids 
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encoding proteins of interest are available for use in the methods and products described 
herein. 

Additional components that can optionally be incorporated into the invention 
constructs include selectable markers and genes encoding proteins required for retroviral 
5 packaging, e.g., the pol gene, the gag gene, the env gene, and the like. 

Selectable markers contemplated for use in the practice of the present invention 
include antibiotic resistance genes, genes that enable cells to process metabolic 
intermediaries, and the like. Exemplary antibiotic resistance genes include genes which 
impart tetracycline resistance, genes that impart ampicillin resistance, neomycin 
1 0 resistance, hygromycin resistance, puromycin resistance, and the like. 

Genes that enable cells to process metabolic intermediaries include genes which 
permit cells to incorporate L-histidinol, genes encoding thymidine kinase, genes 
encoding xanthine-guanine phosphoribosyl transferase (gpt), genes encoding 
dihydrofolate reductase, genes encoding asparagine synthetase, and the like. 

15 As employed herein, the terms "subject organism" and "host" refer to tiie cell, 

tissue, organ or organism in need of transcriptional regulation of exogenous or 
endogenous nucleic acids. The subject organism can be mammaUan or mammalian- 
derived cells or tissue. Exemplary mammals include: humans; domesticated animals, 
e.g., rat, mouse, rabbit, canine, feline, and the like; farm animals, e.g., chicken, bovine, 

20 ovine, porcine, and the like; animals of zoological interest, e.g., monkey, baboon, and the 
like, or a cell thereof Alternatively, a subject organism can be a non-mammalian, 
preferably non-insect, such as a plant, fungus or other non-mammalian species, or a 
cell of such a non-mammalian species. 

As employed herein, the term "Ugand" (or ligand precursor) refers to a steroidal 
25 or non-steroidal substance or compound which, in its native state (or after conversion to 
its "active" form), binds to at least one of the protein units, or to the invention chimeric 
protein, thereby creating a ligand/functional entity complex, which in turn can bind an 
appropriate response element and activate transcription therefrom. Ligands function to 
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modulate transcription of nucleic acid(s) maintained under the control of a response 
element. Such ligands are well known in the art. 

In accordance with one aspect of the present invention, unless and until a suitable 
ligand is administered to the host, substantially no transcription of the desired exogenous 
nucleic acids occurs. Since ecdysteroids, for example, are not naturally present in 
mammalian, plant and fungal systems, and the like, if it is desired that transcription of a 
particular exogenous nucleic acid be vinder precise control of the practitioner, a chimeric 
protein containing an ecdysone receptor as one of the protein units and a suitable dimer 
partner therefore is used and the exogenous nucleic acid is put xmder the control of an 
ecdysone response element, i.e. a response element to which an activated ecdysone 
receptor binds in nature. 

The terms "ecdysone" and "ecdysteroid" as interchangeably used herein, are 
employed in the generic sense (in accordance with common usage in the art), referring to 
a family of ligands with the appropriate binding and transactivation activity (see, for 
example, Cherbas et al., in Biosynthesis^ metabolism and mode of action of invertebrate 
hormones (Ed. J. Hof&nann and M. Porchet), Springer- Verlag, Berlin, p 305-322. An 
ecdysone, therefore, is a compound which acts to modulate gene transcription for a gene 
maintained under the control of an ecdysone response element. 

20-Hydroxy-ecdysone (also known as p-ecdysone) is the major naturally 
occurring ecdysone. Unsubstituted ecdysone (also known as a-ecdysone) is converted 
in peripheral tissues to P-ecdysone. Analogs of the naturally occurring ecdysones are 
also contemplated v^thin the scope of the present invention. Examples of such analogs, 
commonly referred to as ecdysteroids, include ponasterone A, 26 iodoponasterone A, 
muristerone A, inokosterone, 26-mesylinokosterone, and the like. Since it has been 
previously reported that the above-described ecdysones are neither toxic, teratogenic, nor 
known to affect mammalian physiology, they are ideal candidates for use as inducers in 
cultured cells and transgenic mammals according to the invention methods. 

Other phytoecdysteroids are also contemplated for use in the practice of the 
invention as ligands of receptors which recognizes ecdysone response elements. Such 
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phytoecdysteroids are known in the art (J.H. Adler et al.. Lipids 30(3):257-62, 1995). 
The biological effect of phytoecdysteroids in higher animals are also known (V.N. 
Syrov, Eksp. Klin Farmakol 57(5):61-6, 1994). 

Non-steroidal ligands are also contemplated for use in the practice of the present 
invention as ligands of ecdysone response elements. For example, when a ligand not 
normally present in the cells of the host to be treated is desired (i.e., a ligand exogenous 
to the host), a hydrazine can be employed as the ligand, preferably a diacyl hydrazine. 
Such hydrazines include compounds that are readily available and/or are relatively 
inexpensive to manufacture. One such compound, tebufenozide, is a non-steroidal 
ecdysone agonist which is commercially available. This compound specifically targets 
lepidopteran species, including Bombyx mori. Tebufenozide has undergone extensive 
testing in animal hosts and has proved to be of very low toxicity to mammals and other 
non-insect species. 

Additional exemplary hydrazines contemplated for use herein include 1,2-diacyl 
hydrazines (e.g., tebufenozide), N'-substituted-N,N'-disubstituted hydrazines, 
dibenzoylalkyl cyanohydrazines, N-substituted-N-alkyl-N,N-diaroyl hydrazines, N- 
substituted-N-acyl-N-alkyls, carbonyl hydrazines, N-aroyl-N'-alkyl-N'-aroyl hydrazines, 
and the like. Since it has been previously reported that the above-described diacyl 
hydrazines are neither toxic, teratogenic, nor known to affect mammalian physiology, 
they are ideal candidates for use as exogenic ligands (e.g. as inducers) in cultured cells 
and transgenic mammals according to invention methods. 

Ligands, and formulations containing them, administered in a manner compatible 
with the route of administration, the dosage formulation, and in a therapeutically 
effective amount. The required dosage will vary with the particular treatment desired, 
25 the degree and duration of therapeutic effect desired, the judgment of the practitioner, as 
well as properties peculiar to each individual. Moreover, suitable dosage ranges for 
systemic application depend on the route of administration. It is anticipated that dosages 
between about 10 micrograms and about 1 milligram per kilogram of body weight per 
day will be used for therapeutic treatment. 
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An effective amount of ligand contemplated for vise in the practice of the present 
invention is the amount of ligand (e.g., diacyl hydrazine(s)) required to achieve the 
desired level of transcription and/or translation of exogenous nucleic acid. A 
therapeutically effective amount is typically an amount of ligand or ligand precursor that, 
when administered in a physiologically acceptable composition, is sufficient to achieve a 
plasma concentration of the transcribed or expressed nucleic acid product from about 

0. 1 mg/ml to about 100 mg/ml, for example, from about 1.0 mg/ml to about 50 mg/ml, 
and preferably at least about 2 mg/ml and usually 5 to 10 mg/ml. 

Ligand can be administered in a variety of ways, as are well-knovm in the art, 

1. e., by any means that produces contact between ligand and receptor peptide. For 
example, such ligands can be administered topically, orally, intravenously, 
intraperitoneally, intravascularly, and the like. The administration can be by any 
conventional means available for use in conjunction with pharmaceuticals, e.g., by 
intravenous injection, either as individual therapeutically active ingredients or in a 
combination with other therapeutically active ingredients. Ligands contemplated for use 
in the practice of the present invention can be administered alone, but are generally 
administered with a pharmaceutical carrier selected on the basis of the chosen route of 
administration and standard pharmaceutical practice. 

In accordance with a particular embodiment of the present invention, 
pharmaceutically acceptable formulations, and kits thereof, comprising at least one 
ligand for an invention fimctional dimer, for example an ecdysteroid, and a 
pharmaceutically acceptable carrier are contemplated. In accordance with another aspect 
of the present invention, pharmaceutically acceptable formulations consisting essentially 
of at least one ligand and a pharmaceutically acceptable carrier, are contemplated. 
Pharmaceutical formulations of the present invention can be used in the form of a solid, 
a solution, an emulsion, a dispersion, a micelle, a liposome, and the like, wherein the 
resulting formulation contains one or more of the ligands of the present invention, as an 
active ingredient, in admixture with an organic or inorganic carrier or excipient suitable 
for enteral or parenteral applications. 
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The ligand(s) may be compounded, for example, with the usual non-toxic, 
phamiaceutically acceptable carriers suitable for administration by oral, topical, nasal, 
transdermal, intravenous, subcutaneous, intramuscular, intracutaneous, intraperitoneal, 
intravascular, and the like means. Administration in the form of creams, lotions, tablets, 
5 dispersible powders, granules, symps, elixirs, sterile aqueous or non-aqueous solutions, 
suspensions or emulsions, and the like, is contemplated. Exemplary pharmaceutically 
acceptable carriers include carriers for tablets, pellets, capsules, suppositories, solutions, 
emulsions, suspensions, and any other form suitable for use. Such carriers which can be 
used include glucose, lactose, gum acacia, gelatin, mannitol, starch paste, magnesium 

10 trisilicate, talc, com starch, keratin, colloidal silica, potato starch, urea, medium chain 
length triglycerides, dextrans, and other carriers suitable for use in manufacturing 
preparations, in solid, semisolid, or liquid form. In addition auxiliary, stabilizing, 
thickening and coloring agents and/or perfumes may be used. The active compound 
(e.g., ecdysteroid as described herein) is included in the pharmaceutically acceptable 

15 formulation in an amount sufficient to produce the desired effect upon the process or 
condition of diseases. 

Pharmaceutically acceptable formulations containing ligand(s) as active 
ingredient may be in a form suitable for oral use, for example, as aqueous or oily 
suspensions, syrups or elixirs, tablets, troches, lozenges, dispersible powders or granules, 

20 emulsions, or hard or soft capsules. For the preparation of oral liquids, suitable carriers 
include emulsions, solutions, suspensions, symps, and the like, optionally containing 
additives such as wetting agents, emulsifying and suspending agents, dispersing agents, 
sweetening, flavoring, coloring, preserving and perfuming agents, and the like. 
Formulations intended for oral use may be prepared according to any method known to 

25 the art for the manufacture of pharmaceutically acceptable formulations. 

Tablets containing ligand(s) as active ingredient in admixture with non-toxic 
pharmaceutically acceptable excipients may also be manufactured by known methods. 
The excipients used may be, for example, (1) inert diluents such as calcium carbonate, 
lactose, calcium phosphate or sodium phosphate; (2) granulating and disintegrating 
30 agents such as com starch, potato starch or alginic acid; (3) binding agents such as gum 
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tragacanth, com starch, gelatin or acacia, and (4) lubricating agents such as magnesium 
stearate, stearic acid or talc. The tablets may be uncoated or they may be coated by 
known techniques to delay disintegration and absorption in the gastrointestinal tract and 
thereby provide a sustained action over a longer period. For example, a time delay 
5 material such as glyceryl monostearate or glyceryl distearate may be employed. They 
may also be coated by the techniques described in the U.S. Pat. Nos. 4,256,108; 
4,160,452; and 4,265,874, to form osmotic therapeutic tablets for controlled release. 

In some cases, formulations for oral use may be in the form of hard gelatin 
capsules wherein the ligand is mixed with an inert solid diluent, for example, calcium 
carbonate, calcium phosphate or kaolin. They may also be in the form of soft gelatin 
capsules wherein the ligand is mixed with water or an oil medium, for example, peanut 
oil, liquid paraffin, or olive oil. 

The pharmaceutically acceptable formulations may be in the form of a sterile 
injectable suspension. Suitable carriers include non-toxic parenterally-acceptable sterile 
aqueous or non-aqueous solutions, suspensions, or emulsions. This suspension may be 
formulated according to known methods using suitable dispersing or wetting agents and 
suspending agents. They can also be manufactured in the form of sterile water, or some 
other sterile injectable medium immediately before use. Sterile, fixed oils are 
conventionally employed as a solvent or suspending medium. For this purpose any 
bland fixed oil may be employed including synthetic mono- or diglycerides, fatty acids 
(including oleic acid), naturally occurring vegetable oils like sesame oil, coconut oil, 
peanut oil, cottonseed oil, etc., or synthetic fatty vehicles like ethyl oleate or the like. 
They may be sterilized, for example, by filtration through a bacteria-retaining filter, by 
incorporating sterilizing agents into the formulations, by irradiating the formulations, or 
by heating the formulations. Sterile injectable suspensions may also contain adjuvants 
such as preserving, wetting, emulsifying, and dispersing agents. Buffers, preservatives, 
antioxidants, and the like can be incorporated as required. 

Compounds contemplated for use in the practice of the present invention may 
also be administered in the form of suppositories for rectal administration of the drug. 
30 These formulations may be prepared by mixing the drug with a suitable non-irritating 
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excipient, such as cocoa butter, synthetic glyceride esters of polyethylene glycols, which 
are solid at ordinary temperatures, but liquefy and/or dissolve in the rectal cavity to 
release the drug. 

Pharmaceutically acceptable formulations containing suitable ligand(s) are 
preferably administered intravenously, as by injection of a unit dose, for example. The 
term "unit dose," when used in reference to a pharmaceutically acceptable formulation 
of the present invention, refers to a quantity of the pharmaceutical formulation suitable 
as unitary dosage for the subject, each unit containing a predetermined quantity of active 
material calculated to produce the desired therapeutic effect in association with the 
required diluent, i.e., carrier, or vehicle. It may be particularly advantageous to 
administer such formulations in depot or long-lasting form as discussed hereinafter. 

Therapeutic compositions or pharmaceutically acceptable formulations 
containing suitable ligand are preferably administered intravenously, as by injection of a 
xmit dose, for example. The term "unit dose," when used in reference to a therapeutic 
composition of the present invention, refers to a quantity of ligand suitable as xinitary 
dosage for the subject, each unit containing a predetermined quantity of active material 
calculated to produce the desired therapeutic effect in association with the required 
diluent, i.e., carrier, or vehicle. It may be particularly advantageous to administer such 
compounds in depot or long-lasting form. 

20 Suitable regimes for initial administration and booster shots are variable, but are 

typified by an initial administration followed by repeated doses at one or more intervals, 
by a subsequent injection, or other administration. Alternatively, continuous intravenous 
infusion sufficient to maintain concentrations in the blood in the ranges specified for 
in vivo therapies are contemplated. 

25 In accordance with another embodiment of the present invention, there are 

provided methods for producing transgenic animals capable of prolonged and regulated 
expression of exogenous nucleic acid(s), said method comprising introducing into early- 
stage embryos or stem cells of the animal: 
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(i) a nucleic acid construct comprising a promoter and said exogenous 
nucleic acid(s) under the control of a regulatory element; and 

(ii) nucleic acid encoding an invention chimeric protein wherein the chimeric 
protein activates the regulatory element in the presence of a ligand for the 
functional dimer or represses the regulatory element independently of the 
presence of said ligand. 

As used herein, the phrase "transgenic animal" refers to an animal that contains 
one or more expression constructs containing one or more exogenous nucleic acid(s) 
under the transcription control of an operator and/or hormone response element as 
described herein. 

Methods of making transgenic animals using a particular nucleic acid construct 
are well-known in the art. When preparing invention transgenic animals, it is presently 
preferred that two transgenic lines are generated. The first line will express, for 
example, a chimeric protein as described above (e.g., VBEcR). Tissue specificity is 
conferred by the selection of a tissue-specific promoter (e.g., T-cell specific) that vnll 
direct expression of the chimeric protein to appropriate tissue. A second line contains a 
nucleic acid construct comprising a promoter and exogenous nucleic acid under the 
control of a response element, for example, an endogenous response element. Cross- 
breeding of these two lines will provide a transgenic animal that expresses an invention 
chimeric protein and the exogenous nucleic acid. 

In a presently preferred embodiment, an invention transgenic animal contains 
one or more expression constructs containing nucleic acid encoding an invention 
chimeric protein and exogenous nucleic acid under the transcription control of a 
response element. Thus, with tissue specific expression of the chimeric protein as 
described above and timely hgand treatment, gene expression can be induced or 
repressed with spatial, dosage, and/or temporal specificity. 

In accordance with yet another embodiment of the present invention, there are 
provided methods for modulating the transcription of an exogenous nucleic acid in a host 
containing: 
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(i) a nucleic acid construct comprising a promoter and said exogenous 
nucleic acid(s) under the control of a response element; and 

(ii) nucleic acid under the control of an inducible promoter, said nucleic acid 
encoding an invention chimeric protein wherein the functional entity formed by 
the invention chimeric protein activates or represses the response element in the 
presence of a ligand for the entity; and 

said method comprising introducing a ligand not normally present in the 
cells of the host and subjecting the host to conditions suitable to induce or repress 
expression of the invention chimeric protein. 

In accordance with yet another embodiment of the present invention, there are 
provided methods for the expression of recombinant products detrimental to a subject 
organism, said method comprising: 

transforming suitable cells in the organism with: 

(i) a nucleic acid construct comprising a promoter and exogenous nucleic 
acid(s) which express the recombinant product under the control of a regulatory 
element that is not normally present in the cells of said organism, and 

(ii) nucleic acid encoding an invention chimeric protein 

wherein the functional entity formed by the invention chimeric protein 
activates the regulatory element in the presence of a ligand for the functional 
entity; 

growing said cells to the desired level in the substantial absence of the 
ligand; and 

inducing expression of said recombinant product by administering to the 
organism a ligand, which, in combination with said entity, binds to said 
regulatory element and activates transcription therefrom. 

Recombinant products detrimental to a host organism contemplated for 
expression in accordance with the present invention include any gene product that 
functions to confer a toxic effect on the organism. For example, inducible expression of 
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a toxin such as the diphtheria toxin would allow for specific ablation of tissue (Ross et 
al Genes and Development 7:1318-1324 (1993)), for example to create a new phenotype 
in the transgenic animal. Moreover, the numerous gene products that are known to 
induce apoptosis in cells expressing such products are contemplated for use herein (see, 
5 e.g, Apoptosis, The Molecular Basis of Cell Death, Current Communications In Cell & 
Molecular Biology, Cold Spring Harbor Laboratory Press, 1991). 

In accordance with still another embodiment of the present invention, there are 
provided methods for modulating the transcription of nucleic acid(s) in an in vitro 
cellular system, wherein the method comprises administering to the cellular system an 
amount of ligand effective to modulate the transcription of the nucleic acid(s); wherein 
the ligand is not normally present in the cellular system and wherein the system 
comprises: 

(i) a nucleic acid construct comprising a promoter and the nucleic acid(s) 
under the control of a response element; and 

(ii) nucleic acid encoding an invention chimeric protein, 
wherein the functional entity formed by the invention chimeric protein 

activates or represses the regulatory element in the presence of a ligand for the 
ligand binding domain. 

In accordance wdth yet another embodiment of the present invention, there are 
20 provided methods for the treatment of a host in need of gene therapy, said method 
comprising: 

introducing into cells of said host: 

(i) a nucleic acid construct comprising a promoter and the exogenous 
nucleic acid(s) under the control of a response element; 

25 (ii) nucleic acid under the control of an inducible promoter, said nucleic acid 

encoding an invention chimeric protein, 
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wherein the functional dimer formed by the invention chimeric protein 
activates or represses the regulatory element in the presence of a ligand for the 
functional dimer, and 

administering, to said host, an effective amount of ligand for the 
invention functional dimer. 

Optionally, the cells can be obtained from the host, modified as above, and then 
reintroduced into the host organism. For example, the exogenous nucleic acid can be 
introduced directly into cells obtained from a donor (host or separate donor) and the 
modified cells are then implanted within the host organism. In a presently preferred 
embodiment, the transplanted cells are autologous with respect to the host. 
"Autologous" means that the donor and recipient of the cells are one and the same. 

Cells can be modified by "in vivo delivery" of biological materials by such routes 
of administration as oral, intravenous, subcutaneous, intraperitoneal, intrathecal, 
intramuscular, intracranial, inhalational, topical, transdermal, suppository (rectal), 
pessary (vaginal), and the like. The exogenous nucleic acid may be stably incorporated 
into cells or may be transiently expressed using methods known in the art. 

Modified cells are cultivated under growth conditions (as opposed to protein 
expression conditions) until a desired density is achieved. Stably transfected mammalian 
cells may be prepared by transfecting cells with an expression vector having a selectable 
marker gene (such as, for example, the gene for thymidine kinase, dihydrofolate 
reductase, neomycin resistance, and the like), and growing the transfected cells under 
conditions selective for cells expressing the marker gene. To prepare transient 
transfectants, mammalian cells are transfected with a reporter gene (such as the E. coli B- 
galactosidase gene) to monitor transfection efficiency. Selectable marker genes are 
typically not included in the transient transfections because the transfectants are typically 
not grown under selective conditions, and are usually analyzed within a few days after 
transfection. 

The concept of gene replacement therapy for humans involves the introduction 
of functionally active "wild type" or "therapeutic" nucleic acids into the somatic cells of 



an affected host to correct a gene defect or deficiency. However, in order for gene 
replacement therapy to be effective, it must be possible to control the time and location 
at which gene expression occurs. 

Genes that encode useful "gene therapy" proteins that are not normally 
5 transported outside the cell can be used in the invention if such genes are "functionally 
appended" to, or operatively associated with, a signal sequence that can "transport" the 
encoded product across the cell membrane. A variety of such signal sequences are 
known and can be used by those skilled in the art without imdue experimentation. 

Gene transfer vectors (also referred to as "expression vectors") contemplated for 
use herein are recombinant nucleic acid molecules that are used to transport nucleic acid 
into host cells for expression and/or replication thereof. Expression vectors may be 
either circular or linear, and are capable of incorporating a variety of nucleic acid 
constructs therein. Expression vectors typically come in the form of a plasmid that, upon 
introduction into an appropriate host cell, results in expression of the inserted nucleic 
acid. 

Suitable expression vectors for use herein are well known to those of skill in the 
art and include recombinant DNA or RNA construct(s), such as plasmids, phage, 
recombinant virus or other vectors that, upon introduction into an appropriate host cell, 
result(s) in expression of the inserted DNA. Appropriate expression vectors are well 
known to those of skill in the art and include those that are replicable in eukaryotic cells 
and/or prokaryotic cells and those that remain episomal or those which integrate into the 
host cell genome. Expression vectors typically further contain other functionally 
important nucleic acid sequences encoding antibiotic resistance proteins, and the like. 

The amount of exogenous nucleic acid introduced into a host organism, cell or 
25 cellular system can be varied by those of skill in the art. For example, when a viral 

vector is employed to achieve gene transfer, the amount of nucleic acid introduced can 
be varied by varying the amount of plaque forming units (PFU) of the viral vector. 

As used herein, the phrase "transcription regulatory region" refers to that portion 
of a nucleic acid or gene construct that controls the initiation of mRNA transcription. 
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Regulatory regions contemplated for use herein, in the absence of the non-mammalian 
transactivator, typically comprise at least a minimal promoter in combination with a 
regulatory element responsive to the ligand/receptor peptide complex. A minimal 
promoter, when combined with a regulatory element, functions to initiate mRNA 
5 transcription in response to a ligand/functional dimer complex. However, transcription 
will not occur unless the required inducer (ligand therefor) is present. However, as 
described herein certain of the invention chimeric protein heterodimers activate or 
repress mRNA transcription even in the absence of ligand for the DNA binding domain. 

As used herein, the phrase "operatively associated with" refers to the functional 
relationship of DNA wdth regulatory and effector sequences of nucleotides, such as 
promoters, enhancers, transcriptional and translational stop sites, and other signal 
sequences. For example, operative linkage of DNA to a promoter refers to the physical 
and functional relationship between the DNA and promoter such that the transcription of 
such DNA is initiated from the promoter by an RNA polymerase that specifically 
recognizes, binds to and transcribes the DNA. 

Preferably, the transcription regulatory region further comprises a binding site 
for ubiquitous transcription factor(s). Such binding sites are preferably positioned 
between the promoter and the regulatory element. Suitable ubiquitous transcription 
factors for use herein are well-known in the art and include, for example, Spl . 

20 Exemplary eukaryotic expression vectors include eukaryotic constructs, such as 

the pSV-2 gpt system (Mulligan etal, (1979) Nature, 277:108-1 14); pBlueSkript 
(Stratagene, La JoUa, CA), the expression cloning vector described by Genetics Institute 
{Science^ (1985) 228:810-815), and the like. Each of these plasmid vectors is capable of 
promoting expression of the chimeric protein of interest. 

25 Suitable means for introducing (transducing) expression vectors containing 

invention nucleic acid constructs into host cells to produce transduced recombinant cells 
(i.e., cells containing recombinant heterologous nucleic acid) are well-known in the art 
(see, for review, Friedmann, Science, 244:1275-1281, 1989; Mulligan, Science, 260:926- 
932. 1993, each of which are incorporated herein by reference in their entirety). 
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Exemplary methods of transduction include, e.g., infection employing viral vectors (see, 
e.g., U.S. Patent 4,405,712 and 4,650,764), calcium phosphate transfection (U.S. Patents 
4,399,216 and 4,634,665), dextran sulfate transfection, electroporation, lipofection (see, 
e.g., U.S. Patents 4,394,448 and 4,619,794), cytofection, particle bead bombardment, 
5 and the like. The transduced nucleic acid can optionally include sequences which allow 
for its extrachromosomal (i.e., episomal) maintenance, or the transduced nucleic acid can 
be donor nucleic acid that integrates into the genome of the host. 

In a specific embodiment, a gene transfer vector contemplated for use herein is a 
viral vector, such as Adenovirus, adeno-associated vims, a herpes-simplex virus based 
vector, a synthetic vector for gene therapy, and the like (see, e.g., Suhr et al. Arch of 
Neurol 50:1252-1268, 1993). Preferably, a gene transfer vector employed herein is a 
retroviral vector. Retroviral vectors contemplated for use herein are gene transfer 
plasmids that have an expression construct containing an exogenous nucleic acid 
residing between two retroviral LTRs. Retroviral vectors typically contain appropriate 
packaging signals that enable the retroviral vector, or RNA transcribed using the 
retroviral vector as a template, to be packaged into a viral virion in an appropriate 
packaging cell line (see, e.g., U.S. Patent 4,650,764). 

Suitable retroviral vectors for use herein are described, for example, in U.S. 
Patents 5,399,346 and 5,252,479; and in WIPO publications WO 92/07573, WO 
20 90/06997, WO 89/05345, WO 92/05266 and WO 92/14829, each of which is hereby 

incorporated herein by reference, in its entirety. These documents provide a description 
of methods for efficiently introducing nucleic acids into human cells using such 
retroviral vectors. Other retroviral vectors include, for example, mouse mammary tumor 
virus vectors (e.g., Shackleford et al, (1988) PNAS, USA, 85:9655-9659), human 
25 immunodeficiency virus (e.g., Naldini et al (1996) Science 272: 165-320), and the like. 

Various procedures are also well-known in the art for providing helper cells 
which produce retroviral vector particles that are essentially firee of replicating vims. 
See, for example, U.S. Patent 4,650,764; Miller, Human Gene Therapy, 1:5-14, 1990; 
Markowitz, et al. Journal of Virology, M{4):1 120-1 124, 1988; Watanabe, etal, 
30 Molecular and Cellular Biology, 2a2):224 1-2249, 1983; Danos, et al, PNAS, M:6460- 
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6464, 1988; and Bosselman, et al. Molecular and Cellular Biology, 2(5): 1797- 1806, 
1987, which disclose procedures for producing viral vectors and helper cells that 
minimize the chances for producing a viral vector that includes a replicating virus. 

Recombinant retroviruses suitable for carrying out the invention methods are 
5 produced employing well-known methods for producing retroviral virions. See, for 
example, U.S. Patent 4,650,764; Miller, supra 1990; Markowitz, et al, supra 1988; 
Watanabe, et al, supra 1983; Danos, et al., PNAS, S5:6460-6464, 1988; and Bossehnan, 
et al , Molecular and Cellular Biology, 2(5): 1 797-1 806, 1 987. 

For example, in one embodiment, a modular assembly retroviral vector (MARV) 
can be utilized to express the invention chimeric protein and an antibiotic resistance 
gene. A "covector" (referred to herein as MARSHA) can be utilized to provide a nucleic 
acid construct comprising the promoter, the regulatory element and exogenous nucleic 
acid, and a second antibiotic resistance gene. The MARSHA vector carrying exogenous 
nucleic acid also has LTRs modified to promote high-level expression only in the 
presence of the invention chimeric protein encoded by MARV and exogenous ligand 
therefor. Co-infected primary mammalian cells can then be selected using both 
antibiotics, resulting in a cell population that is dependent on ligand for high-level 
expression of the exogenous nucleic acid. 

By introducing all of the necessary regulatory machinery, plus exogenous 
nucleic acid, selectable markers, and nucleic acid encoding invention chimeric protein, 
e.g., into a MARV retrovims, highly efficient insertion of exogenous nucleic acids into 
targeted cells can be achieved. 

Thus, the above-described viral constructs address several important problems 
confronted in the use of retroviruses in application of therapeutic gene transfer strategies 
25 to a variety of human diseases. For example, the retroviral vectors of the invention are 
capable of prolonged gene expression under conditions where conventionally integrated 
retroviruses are no longer transcriptionally active. 

To illustrate the invention chimeric protein FDs, EcR was used as the 
steroid/thyroid hormone nuclear receptor and multiple examples using either RXR or 
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Usp as the dimer partner were constructed, with either the EcR or the dimer partner 
positioned at the amino terminus. The DNA binding, transactivation, and 
dimerization properties of these several FD variants were compared with the 
properties of native receptor complexes. The size of the ENU and ENR FDs prepared 
5 was in the range from about 1 35 to about 145 kD; whereas E alone had a size of 94 
kD, as shown by Westem blot analysis. 

EcR-Usp and EcR-RXR FDs efficiently bind EcREs 

The FDs were first examined for their ability to bind to target EcREs in 
response to ligand. FD proteins were extracted from transiently transfected human 
293 cells that were either xmtreated or treated with 1 |j,M murA for 40 hours. To 
eliminate the possibility that certain of the FD constructs were translated with greater 
efficiency than others, which would lead to a false appearance of higher fiinctional 
binding in comparative luciferase expression tests, p-galactosidase expression of 
internal control plasmids cotransfected with FD constructs was performed. These 
tests indicated no significant differences in the transfection efficiency of individual 
FD constructs, indicating that the DNA binding differences observed for different FDs 
reflect intermolecular properties of the FDs themselves, not expression level. 
Accordingly, all reactions were normalized to an intemal p-galactosidase control and 
total protein using the anti-EcR monoclonal antibody DDA2.7 (Koelle et aL, supra^ 
1991). 

FD constructs with either 0 or 5 linker segments were assayed for their ability 
to bind labeled EcREs, as shown in Figure 2B. A prominent band co-migrating with 
band shifts for the separate dimer complexes (E+U and E+R) was observed in many 
lanes and indicated that some of the FDs formed functional DNA-binding intemal 
25 dimers, referred to herein as "endodimers." For example, ROE, which is analogous to 
UOE, except for the substitution of U for R as the dimer partner, formed a clear 
endodimer band that was increased 4-fold by ligand, and demonstrated unequivocally 
a response to hormone. R5E, with a longer linker, had slightly decreased basal, and 
slightly increased ligand-stimulated, EcRE binding (5-fold) compared to that of ROE. 
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Constructs in which E is positioned at the N-terminus form endodimers and 
bind the EcRE probe an average of 10 times better than UNE constructs (Figure 2 A). 
In addition, ENU constructs bind probe 80-150% more readily than even those other 
FDs with high-level EcRE binding, such as E5R, but display nearly complete 
5 insensitivity to iigand for formation of DNA-binding complexes. This effect was 
foimd to be substantially independent of linker length. 

ENR FDs also demonstrate a greater affinity for the EcRE probe than do RNE 
constructs. However, unlike ENU constructs, ENR FDs have a high degree of 
dependence upon the presence of ligand for formation of endodimers. 

The observation that ENU and ENR FDs bound probe better than either the 
RNE construct or the UNE construct prompted examination of whether the large 220 
amino acid F domain of E (not found on R or U) accounted for this effect. Further 
EcRE probe binding studies performed utilizing several FDs having in-frame 
incremental deletions of EOR to Nhel, PvuII, Narl, and Bglll sites within the 
ecdysone receptor F domain (Figure 2B) showed that the deletions had a minimal 
effect on either response to ligand, or on binding to the EcRE probe. Only EOR- 
ABglll, in which the extreme C-terminal end of the hormone binding domain is 
removed, displayed significant loss of the shifted band. These results suggest that 
flexibility vdthin the long EcR F domain is not the primary determinant of improved 
DNA binding by FDs containing a EcR at the N terminal. This was presumed to be 
the result of a perturbed ligand binding pocket as opposed to decreased flexibility of 
the dimer partners joined by a linker. 

EcR-RXR and EcR-Usp FD Transactivation in Response to Ligand 

The results shown in Figure 2 clearly indicate that the EcR-RXR and EcR-Usp 
25 FD chimeric proteins could both respond to hormone and interact v/ith the response 
element for EcR; however, these tests provided no indication regarding the function 
of these proteins to transactivate responsive promoters and induce gene expression. 
To determine the ability of the FDs to transactivate responsive promoters and induce 
gene expression, the FDs and an EcRE-luciferase reporter plasmid were co- 
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transfected into 293 cells. The results of these studies revealed that the activity of the 
FDs w^as variably diminished compared to that of monomeric receptors. 

It has also been discovered that the invention ENU FDs are constitutive 
repressors of transcription of a gene under the control of the corresponding 
5 steroid/thyroid hormone nuclear receptor's response element. For example, UNE 
constructs did not efficiently bind target EcREs (Figure 2), and predictably did not 
have a dramatic influence on expression of the E4-luc reporter plasmid. ENUs, on the 
other hand, although appearing to readily bind EcREs, were also unable to stimulate 
luciferase expression. To further confirm whether ENU proteins could bind target 

10 response elements, but had lost the capacity to transactivate, the ability of EOU to 
competitively block ligand-stimulated luciferase expression by monomeric receptors 
was tested. EOU elicited a dose-dependent inhibition of VE with endogenous dimer 
partner, whereas UOE had virtually no inhibitory influence (Figure 5 A). At the lowest 
ratio tested, a 1:20 ratio of EOU to VE decreased stimulated expression of VE by 

15 20%, while equimolar EOU blocked 80% of the response to ligand. At any 

concentration tested, UOE exhibited no suppressive effect on VE-mediated activation, 
and in fact, appeared to increase the stimulated level of expression by about 5% to 
about 15%. EOU had a similar influence on E without VP 16 fusion (Figure 5B), and a 
lesser but measurable inhibitory effect on VE combined with separate exogenous Usp 

20 (Figure 5C). The 50% inhibition of the VE+U basal transactivation level may suggest 
that EOU binds the target EcRE about as well as complexes of the separate receptors. 
The plateau of the EOU suppressive effect at a 1 : 1 ratio with both E and E+U and the 
slightly increased stimulation at a 5:1 ratio of EOU to separate receptors indicate that, 
at higher concentrations, EOU can weakly transactivate in response to added ligand. 

25 In the experiments described herein utilizing FDs containing various 

combinations of DEcR and BEcR with either RXR or Usp, the length of the linker 
was not observed to have a significant effect on any of the functions of the FDs. 
Surprisingly, even shortening of the large F-domain of Drosophilia melanogaster EcR 
had little impact on ligand-responsive DNA binding of EcR FDs, Without restriction 

30 of the scope of the invention by any theoretical speculations, two possible 
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explanations of this observation are offered. One possibility is that there is enough 
flexibility within the structure of the individual receptors that any deformation 
necessary to allow^ appropriate dimerization can be tolerated while preserving, in 
some cases, nearly complete activity. The second possibility is that the C-terminus of 
5 the 5' receptor naturally lies close in spatial proximity to the N-terminus of the 3' 
receptor such that the distance is easily spanned. In the absence of detailed structural 
data of intact nuclear hormone receptor dimers, neither explanation can be ruled out. 

In the presence of ligand (i.e., MurA), RXR-containing FDs functioned 
similarly to monomeric ECR+RXR, but at approximately one-half of the maximum 

10 level of transactivation (for E5R). One likely explanation for this is that the FD 

constructs are expressed or translated less efficiently than the monomeric receptors as 
suggested by comparing the intensity of FD lanes to the E lane in a Westem blot 
analysis. Although the level of absolute transactivation was halved, the relative 
induction of transcription by the ENR constructs, in particular, exceeded the relative 

15 induction of monomeric EcR and RXR. 

To further address the partial or complete loss of transcriptional activation for 
FDs relative to the separate proteins, the powerful VP 16 transactivating domain was 
coupled to the amino-termini of FDs or separate EcR to determine if this addition 
could restore lost transactivational capacity. While a linker containing 5 linker 
20 segments (65 amino acids) was sufficient to allow good DNA binding, it may not 
have allowed sufficient freedom of movement for other domains, including those 
responsible for transactivation, to fold or orient as they do in the native proteins. 
Length of the linkers in VE5R and VE5U was therefore increased in increments of 5 
linker segments each to 10 and ultimately 20 linker segments (240 + 5 amino acids). 

25 Addition of the VP 16 activation domain to the amino terminus of the ENR 

FDs restored the full transactivation potential of the FDs relative to separate VP16- 
fused EcR, suggesting that addition of a strong transactivating domain overrides 
translational deficits caused by incorporating the receptor/dimer partner into a 
chimeric protein. However, VP16-ENU (VENU) chimeric proteins never exceeded 

30 30% of the absolute level of transactivation of separate VE+U. One possible 
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explanation is that some conformation constraint in the Usp half of the VENU 
constructs prevented the interaction of VENU with endogenous cofactors necessary 
for full high level expression. A second possibility is that increased spontaneous 
heterodimerization within ENU FDs results in a conformational alteration that 
5 decreases or blocks ligand binding, assuming that ligand plays a direct role in 
transactivation and not just dimerization. 

High level VENR transactivation suggests that the very low level expression 
observed from unliganded ENR constructs in Figure 3 was due to transcriptional 
repression by bound ENR proteins. The addition of VP 16 increased the basal VENR 
_ 10 FD expression over 5-fold in comparison to separate VECR+RXR. This presumably 

3 reflected an increased level of spontaneous dimer formation of ENR FDs resulting 

;|l from the forced proximity of the separate components by the linker. This 

M phenomenon may not have been readily evident in the gel shift experiments as the 

result of the short transient period of interaction of the proteins with the DNA probes 
15 (<30 minutes) compared to the duration with the responsive promoters in the transient 
M transfection experiments (>30 hours). 

Heterodimer formation and DNA binding of FDs 

Five different classes of interaction of protein units in the invention chimeric 
protein(s) (Figure 7) were predicted: 1) "disorganized", indicative of non-interaction 

20 by the individual receptor components; 2) "endodimer", indicating formation of 
functional dimers approximating the native heterodimer complex; 3) "trimer", 
indicating interaction with a separate monomeric partner, 4) "tetramer", indicative of 
two mutually cross-interacting chimeric proteins ; or 5) "oligomers", representing 
chimeric proteins chain-interacting with each other. The data presented herein 

25 provides clear evidence that disorganized, endodimer, and trimer species were formed 
when the invention chimeric proteins contained two protein units (FDs), with 
endodimer formation predominating. Only the UNE constructs appeared to be largely 
disorganized, as evidenced by their apparent inability even to bind DNA with high 
affinity. All remaining FD classes showed abundant evidence of endodimer 

30 formation, although RNE constructs were noticeably weaker than constructs with EcR 
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in the amino terminal position. The transient-transfection competition experiment 
(Figure 4) indirectly indicates that a high affinity monomeric dimer partner, such as 
VUsp, can displace a weaker intramolecular RXR dimer partner to form a partially 
functional trimer species; however, such a monomeric, high affinity dimer partner 
5 was much less capable of displacing an intramolecular Usp under the same 

circumstances. The lower affinity VRXR monomeric dimer partner was unable to 
displace to any significant degree either Usp or RXR as an intramolecular dimer 
partner in an invention FD fusion protein. 

Evidence of formation of higher order constructs, such as tetramers or 
10 oligomers, in gel shift assays is scant, but may be suggested in lanes of the gel shift 
assay with UNE and RNE constructs. Although band shifts were weak, UNE (and 
unliganded RNE) constructs had slightly intensified bands of higher molecular weight 
than the corresponding size of the endodimer bands. The faint but detectable high 
molecular weight shift bands observed for FDs in some lanes of the assays suggest 
15 tetramer and oligomer formation, presumably through cross interaction of the 
ecdysone receptor component of one FD with the dimer partner unit of another. 
These results, coupled with the results of competition experiments using 
superphysiological levels of competing dimer partner, suggest, in any event, that 
multimerization is relatively rare and is likely to occur, at even low levels, only with 
20 those FDs that have decreased capacity for endodimer formation. 

These data fiirther support the supposition that proximity to dimer partner (i.e., 
as in invention chimeric protein(s)) not only limits dimer partner preference, but also 
increases the ease of dimer formation and DNA binding for some of the fiision 
constructs relative to monomeric receptors. For example, ENU constructs displayed 

25 high-level complex formation with the EcRE probe (a 1 . 1 - and 0.9-fold increase for 
EU and E5U, respectively) even in the absence of ligand, while separate EcR and Usp 
required ligand for maximal complex formation. ENR constructs, on the other hand, 
still retained much of their original ligand dependence, indicating that dimer partner 
proximity is not the sole, or perhaps even most important, determinant of dimer 

30 formation. 
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The degree to which FDs interact with external receptors to form a trimer 
complex was indirectly examined in the studies showing their interaction with high 
levels of competing VP16"fusion dimer partners and the resulting effect on 
transactivation of the E4-luc reporter. In these studies, monomeric VRXR was unable 
to enter the FD complex of any construct, suggesting that the EcR component of FDs 
much prefers a linked dimer partner of either high or low affinity to a separate low 
affinity dimer partner. VUsp, on the other hand, had a comparatively smaller effect 
on transactivation induced by the E5U construct, than on E5R, indicating that the EcR 
protein in E5U preferentially dimerizes with the linked Usp, while the RXR dimer 
partner of E5R may enter a complex with monomeric VUsp. 

In summary, the results of the studies described herein indicate that selected 
chimeric proteins of steroid/thyroid hormone nuclear receptors with appropriate dimer 
partners can retain most of the primary characteristics of the native complex: binding 
of ligand, recognition and binding of cognate response elements, and, in some cases, 
ligand-stimulated transactivation of responsive promoters. Subsets of the constructs 
prepared displayed varying degrees of these characteristics. The R/UNE proteins 
characteristically exhibited low DNA binding and transactivation capacity while the 
ENR/U proteins uniformly demonstrated wild-type or superior EcRE binding and 
variable capacity to transactivate, resulting in properties ranging from constitutive 
repression to essentially wildtype, ligand-responsive transactivation. 

The invention herein provides the advantage that, for many studies in cultured 
cells or transgenic animals, invention FDs will allow the examination of specific 
heterodimer pairs with much decreased potential for contamination with exterior 
dimer partners, such as those endogenously produced in the test cell or animal. This 
25 may be of particular assistance in examining the function of specific RXR subtype 
combinations, or even for further studying the potential for Usp-Usp interactions. 

The invention herein provides the further advantage that specific heterodimer 
pairs can be examined by their introduction into the system as a single chimeric 
protein, e.g., as a fiision protein, rather than by separate introduction of two 
30 constructs. 
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In addition, studies described in the Examples herein indicate that many of the 
combinations may have unique properties of Ugand independence or repression that 
may have significance to their application for therapeutic purposes. For example, 
certain of the invention chimeric proteins that transactivate gene expression may be 
5 useful as a "gene switch," for modulating expression of an exogenous gene in a 

mammalian system or in plants, fungi and other non-mammalian species. When the 
FDs transactivate the response element-containing promoter (e.g., in the presence of 
ligand), the exogenous gene is switched on; when the FDs repress the promoter, the 
exogenous gene is switched off. 

In nature, dimers of nuclear hormone receptors are imstable and, hence, are 
not useful in x-ray crystallography studies to determine structure. Because of the 
demonstrated stability of the invention FD heterodimers, they may be advantageously 
used in the preparation of crystals for x-ray diffraction studies for use in rational 
design of ligands to develop new steroids, insecticides, steroid antagonists, and the 
like, as described hereinbelow. Crystal structure may also permit deduction of the 
structure of ligands for orphan receptors. It is also contemplated that such crystals of 
the invention FDs can be used for preparation of antibodies that react with the 
heterodimers using methods known in the art. 

In accordance with another embodiment of the present invention, there are 
provided isolated protein crystals suitable for x-ray diffraction analysis of a purified 
invention chimeric protein. In altemative embodiments, the crystal may be obtained 
of a ligand bound to a purified chimeric protein so as to form a chimeric protein- 
ligand complex, or a crystal may be obtained of a putative response element bound to 
purified fusion protein or fusion protein ligand complex as described herein. The 
invention additionally contemplates a set of x-ray diffraction crystal coordinates 
obtained by x-ray diffi"action of any such invention isolated protein crystals. 

A variety of methods are knowii in the art for purifying proteins and obtaining 
crystals of the pxirified proteins, for example growing crystals in microgravity and/or 
by vapor diffusion (D.R. Davies and D. M. Segal, Meth EnzymoL 22:266, 1971). 
30 Crystals of purified proteins can also be obtained commercially. To aid in the 
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purification of the invention fusion proteins, it is recommended to add a His tag to the 
amino terminus of the fusion protein, as is known in the art and described in Example 
6 herein. Addition of such a His tag does not interfere with dimerization of the 
invention fusion proteins. 

In accordance with still another embodiment of the present invention, there are 
provided methods for identifying potential ligand(s) for member(s) of the 
steroid/thyroid hormone nuclear receptor superfamily utilizing a set of atomic 
coordinates obtained by x-ray diffraction analysis of an invention purified protein 
crystal. The invention assay method comprises creating a three-dimensional structure 
of a chimeric protein formed into a functional entity (i.e., by dimerization of 
dimerization domains contained therein) as defined by the atomic coordinates 
obtained by x-ray diffiraction studies, employing the three-dimensional structure to 
design or select the potential ligand; synthesizing the potential ligand; and then 
contacting the potential ligand with an invention functional entity in the presence of a 
response element operatively linked to a marker protein imder conditions suitable for 
causing expression of the marker protein to determine the ability of the potential 
ligand to transactivate expression of the marker protein. The potential ligand can be 
designed de novo or designed from a ligand. 

Methods for obtaining a set of atomic coordinates of a protein crystal using x- 
ray diffraction and for creating a three-dimensional model of a protein from such a set 
of atomic coordinates are known in the art. Such procedures are disclosed, for 
example, in U. S. Patent No. 5,856,1 16, which is incorporated herein by reference in 
its entirety. For example, x-ray data sets can be collected on a R-axis IIC image plate 
system and/or on a 2. 2. A Synchrotron data set for refinement of the three-dimensional 
structure (i.e., the model). Then, the data can be collected at Cornell High Energy 
Synchrotron Source ("CHESS") on a charge-couple device and reduced to stmcture 
factor amplitudes using the Denzo Software Package (Denzo~An Oscillation Data 
Processing Program For Macro Molecular Crystallography, ©1993, Daniel Gewirth, 
Yale University). Oscillation photographs can be integrated and reduced to structure 
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factor amplitudes using software supplied by the manufacturer (Molecular Structures 
Corp., Dallas, Tex.). 

Refined heavy atom parameters can be used to compute multiple isomorphous 
replacement phases. Solvent flattening and phase extension (CCP4-Collaborative 
Computing Project No. 4, A Suite of Programs for Protein Crystallography; 
Daresbury Laboratory, Warrington, WA4 4 AD, U.K. (1979)) can be used to improve 
the map and allow identification of some of the residues in the protein core. Cycles of 
model building (Quanta, version 4.0b, Molecular Simulations Inc., Burlington Mass.), 
positional refinement, (Brunger, A. T., J, Acta Cryst., A46: 46-57, 1990); Brunger, A. 
T. et al., J. Acta Cryst,, A4fi:585-93, 1990) and phase combination (CCP4- 
CoUaborative Computing Project, supra) can be carried out until the switch to phases 
calculated from the model can be made. Refinement against -16 °C., 2.2. A data can be 
continued to allow the more difficult loop regions of the protein to be constructed. 

The invention will now be described in greater detail by reference to the 
following non-limiting examples. 

EXAMPLE 1 

Design of ecdysone receptor-Usp/RXR functional dimers 

Two classes of chimeric proteins were constructed as fusion proteins to study 
the activity of EcR-Usp/RXR functional dimers. In one class, EcR is at the N- 
terminus of a fusion protein, and in the other class the binding partner (either Usp or 
RXR) is at the N-terminus (Figure 1). To facilitate formation of the functional dimers 
and allow for insertion of polypeptide linkers between the receptor and its binding 
partner in the fusion protein, a 5 amino acid bridge that also encodes the restriction 
endonuclease site for Sfil was inserted between the two open reading frames (ORFs). 

A double stranded Sfil compatible oligonucleotide encoding the amino acid 
sequence GPGGGSGGGSGT (SEQ ID NO: 17) was designed to provide a high 
degree of predicted fiexibility while attempting to minimize repetitive sequence 
within the oligonucleotide. This nucleotide sequence incorporated the Sfil site at the 
5 ' end of the insert to allow for ease in increasing the number of linker segments 
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within a previously existing construct. By phosphorylating the 36-base-pair double 
stranded oligonucleotides and ligating them into Sfil digested FD plasmid templates, 
FDs were produced with peptide linkers of variable length that increased by 12 amino 
acid increments. 

5 Construction of fusion proteins containing the ecdysone receptor. 

Figure 1 shows the schematically fusion protein functional dimer constructs 
R/U(N)E and E(N)R/U. Construction of the invention fusion proteins began with 
modification of the N and C termini of human RXRa, dmUsp and dmEcR ORFs 
subcloned into the cloning vector SK-NBN (pBSK with a modified poly linker). An 
Sfil site was inserted at either end of each receptor, in-frame, by PCR mutagenesis. 

For the hRXR N-terminal Sfil site, the primer in the 5' direction was 
GTAGAATTCGGCCAACAGGGCCCATGGACACCAAACATTTC (SEQ ID 
NO: 1 8); and the primer in the 3 ' direction was GATGGGGGAGCTCAGGGTGC 
(SEQ ID NO:19). 

For the C-terminal Sfil site, the primer in the 5' direction was 
GGAGAGCTCGAGGCCTACTGCA (SEQ ID NO:20); and the primer in the 3' 
direction was ACCATCGATTCAGGGCCCTGTTGGCCCGTGCGGCGCCTC (SEQ 
IDNO:21). 

For the dmusp N terminal Sfil site, the primer in the 5' direction was 
20 GTAGAATTCGGCCAACAGGGCCCATGGACAACTGCGACCAG (SEQ ID 
NO:22); and the primer in the 3' direction was CAGCACGTGGACCATTGACA 
(SEQ ID NO:23). 

For the C-terminal Sfil site, the primer in the 5' direction was 
GGAGAGCTCTTTCTCGAGCAGCTG (SEQ ID NO:24); and the primer in the 3' 
25 direction was 

ACCATCGATTCAGGGCCCTGTTGGCCCCTCCAGTTTCATCGCCA GGCCG 
(SEQ ID NO:25). 



10 



15 




59 

For the ecdysone receptor N-terminal Sfil site, VP 16 sequences were fused in 
frame to the Ncol site approximately 200 base pairs into the ecdysone receptor ORF, 
creating an Sfil site at the VP16-ecdysone receptor boundary. 

For the VP 16 insertion site, the primer in the 5' direction was 
5 CATAAGCTTATGGGACAGACACTGATGGGACGGCCC (SEQ ID NO:26) and 
the primer in the 3' direction was 

CAGAGACCATGGGCCCTGTTGGCCCCCCACC (SEQ ID NO:27). 

For the ecdysone receptor C-terminus insertion site, the primer in the 5' 
direction was TTACCGCTAGCTCCACCA (SEQ ID NO:28); and the primer in the 
3' direction was GTAGATATCAGGGCCCTGTTGGCCCAGTCGTCGAGT (SEQ 
ID NO:29). All primer sequences are written 5' to 3'. 

For VP 16 (S.J. Triezenberg et al. Genes Dev., 2:718-729, 1988) fusion to 
RXR and Usp, the VP 16 sequence region was removed from VE using the Sfil site at 
the 3' boundary of VP 16 sequence for fusion of the 260 base pair VP 16 fragment into 
the N-terminal compatible Sfil site of previously modified RXR and Usp ORFs. All 
fusion receptor variants were originally produced by insertion of both ORFs at the 
central Sfil site. Linker segments with Sfil compatible overhangs were produced by 
annealing two linker-encoding oligonucleotides having the sequence 
GGGCCAGGAGGTGGCTCCGGGGGAGGTTCAGGCACA (SEQ ID NO:30) in 
the 5' direction, and the sequence 

GCCTGAACCTCCCCCGGAGCCACCTCCTGGCCCTGT (SEQ ID NO:31) in the 
3' direction. 

EcR F-domain deletion constmcts were produced by inserting an in-frame 
polylinker upstream of the Sfil-N-terminal modified RXR for reception of compatible 
25 F-domain deleted ecdysone receptor fragments. The polylinker in the 5' direction, 
AAGCTTGAGAGATCTGGGACGGCGCCCCCGGGGCTAGCGGGCCAACA 
(SEQ ID NO:32) encoded (from Bgl II) the peptide sequence IWDGAPGAS (SEQ ID 
NO:33) and restriction sites Hind III-Bgl II-Nar I-Sma I-Nhe I with an Sfil 
compatible 3' end. Hind III-Bgl II, Hind III-Nar I, etc. fragments of the ecdysone 
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receptor were inserted into this polylinker for fusion of F-domain deletions to RXR. 
Figure 2B shows schematically the F-domain deletion constructs of EOR made by this 
procedure. 

PGR reactions for production of receptor mutants were performed using 100 
ng plasmid template, 500 ng of each primer, and reaction conditions outlined by the 
manufacturer for Pwo (Boerhinger Mannheim) high-fidelity polymerase. A program 
of 1 min. 94°C/1 min. 45'*C/1 min. 72°C/1 min. for 20 cycles was used for production 
of all PGR products used. For constructs containing multiple repeat linker segments, 
fusion receptors were linearized by Sfil digest, and linker segment oligonucleotides, 
kinased to allow multiple tandem insertions, were ligated into the site by standard 
methods. Inserted linker segment repeats of between 0 and 5 linker segments were 
found by restriction endonuclease digest followed by sizing on 3% agarose gels. For 
the studies reported here, the minimum linker length contained only the 5 amino acid 
fusion bridge (signified herein by linker segment designation "0") and the maximum 
was 245 amino acids (including a five amino acid fusion bridge) (signified herein by 
linker segment designation "20"). 

Plasmids from clones of interest were prepared on a large scale for use in 
transfection and other analysis including confirmatory sequencing of constructs. All 
receptors were subcloned into vector LNGX (A.D. Miller, GenBank Acc. No. 
M28247) (with an extended polylinker) for use in transfection. 

EXAMPLE 2 
Transfection of FD constructs. 

For quantitative transactivation analysis, transfections were performed in 
triplicate in 24-well plates by calcium-phosphate co-precipitation with 100 ng of an 
individual receptor, the reporter plasmid E4-luc, and pGHl 10 (SV40-p-galactosidase) 
as an internal control. Briefly, the reporter plasmid, E4-luc, was constructed of 4- 
tandem EcREs inserted upstream of a thymidine kinase gene minimal promoter 
directing luciferase expression. EcRE oligonucleotides were as described by Thomas 
et aL, supra (1993) with BamHI/Bglll compatible ends. 1 |LiM ligand was added at the 
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time of transfection, and the cells were harvested for luciferase assay 40 hours later. 
Harvested cell extracts were split, and one part was analyzed in a luminometer for 
luciferase activity and the other part was analyzed for P-galactosidase activity using 
an orthonitrophenyl galactoside assay by standard methods. Luciferase levels were 
5 normalized to p-galactosidase values to correct for slight differences in transfection 
efficiency. 

Preparation of the invention fusion protein FDs for gel shift analysis by 
transient transfection into 293 cells was as follows: 300 ng of individual receptor 
plasmids and 100 ng pCHl 10 intemal control plasmid were cotransfected into 293 

10 cells at 60% density in Costar 6-well plates. One well of each group was treated with 
1 laM murA as ligand at the time of transfection. 40 hours later, extracts of transfected 
293 cells were made by scraping the cells from a well into a low volume of phosphate 
buffered saline, pelleting the cells, resuspending them in 200 |il 5X gel shift buffer 
(Yao et al, supra 1992), and sonicating with a Kontes cell disrupter for three 10-sec. 

15 bursts at output level 30. The extracts were then centrifiiged to clear the lysate and 
the protein was quantified. Extract volumes were adjusted vnth buffer to a 
concentration of 1 mg/ml and frozen at -70°C ixntil use. p-galactosidase activity was 
assayed as above to determine relative transfection efficiency of each well. 

EXAMPLE 3 

20 Gel mobility shift analysis. 

Comparative gel mobility shift analyses of FDs with control in vitro translated 
receptor complexes were performed using double stranded EcRE probes and labeled 
by Klenow fill using ^^P-dCTP and cold dGAT by standard methods. In vitro 
translated proteins used as controls were produced using the T3/T7 TNT (Promega) 

25 transcription/translation system follov^ng the manufacturer's protocol. In vitro 

translated proteins were qualitatively examined by 5% SDS-PAGE using protocols as 
described (J. Sambrook et al Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Press, Cold Spring Harbor, New York, 1989) to ensure the presence of full- 
length products of the proper molecular weight. Reaction conditions for protein- 

30 probe interaction and gel electrophoresis were essentially identical to those disclosed 
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in Yao et al, supra 1992. However, to improve comparison between samples, 
reaction mixtures (including dimer partners and probe) were prepared as a cocktail 
and distributed equally to individual tubes with receptor proteins, p-galactosidase 
assay indicated that all samples were essentially equivalent, so equal volumes (10 ]ul) 
5 of extract were used in each reaction in a final reaction volume of 30 \x\. The 
reactions were allowed to proceed at IVC for 5 minutes at which time ligand or 
vehicle was added and the reaction allowed to continue for 20 additional minutes. 
Band volumes were quantified using laser scanning densitometry. 

Western blot analysis 

6-well plates of 293 cells transfected with 2 |ig of receptor construct and 100 
ng of pCHl 10 were harvested in PBS and lysed by three rounds of fireezing and 
thawing, p-galactosidase assay of lysates indicated equivalent transfection efficiency 
for all constructs so equivalent protein (7.5 |ag) was loaded and run on a 12.5% SDS- 
PAGE gel and transferred to nitrocellulose by standard methods (Sambrook et al, 
supra 1989). The transferred filter was incubated with the anti-EcR monoclonal 
antibody DDA2.7 {Koelle et al, supra 1991) at a 1/1000 dilution at 4°C for 48 hours. 
After washes, anti-mouse IgG (1/5000) was added for 1 hour, washed away, and the 
blot processed for chemiluminescence and exposed to film. 

An autoradiogram of FDs and controls treated either with control vehicle or 
with 1 \xM of mur A as ligand was made with markers indicating endodimer FD or 
wild type receptor-binding complex band shifts. E+U and E+R were control lanes of 
in vitro translated proteins for sizing of endodimer band shifts. Figure 2A is a graph 
that quantifies endodimer-sized band volvmies obtained fi-om the autoradiogram. 

A prominent band co-migrating with band shifts for the separate dimer 
25 complexes (E+U and E+R) was observed in many lanes and indicated that some of the 
FDs formed fimctional DNA-binding internal dimers that we designate "endodimers." 
UOE displayed a barely detectable endodimer band that was perceptibly increased (2- 
fold) in intensity by the presence of ligand (Figure 2B). The addition of 5 linker 
segments in U5E appeared to amplify the overall intensity of the shift, but still 
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displayed minimal response to ligand. UNE constructs displayed faint bands above 
the weak endodimer band-shift that were equally as intense. These bands were 
occasionally visible but were proportionally less intense than the endodimer band 
shift of other FDs described below. ROE, which is analogous to UOE but with 
substitution of U for R, formed a clear endodimer band that was increased 4-fold by 
ligand (Figure 2B), and xmequivocally demonstrated FD responds to hormone. R5E, 
with a longer linker, had slightly decreased basal, and slightly increased ligand- 
stimulated, EcRE binding (5-fold) compared to ROE. The higher molecular weight 
shift bands observed in UNE lanes were not visible in ligand-treated R5E lanes 
(Figures 2A). 

ENU constructs indicated that FDs in which E was positioned at the N- 
terminus formed endodimers and bound the EcRE probe an average of 10 times better 
than UNE constructs. ENU constructs bound probe 80-150% more readily than even 
other FDs with high-level EcRE binding (i.e., E5R, discussed below), but displayed 
nearly complete insensitivity to ligand for formation of DNA-binding complexes. 
Like UNE FDs, the longer linker length of E5U did not significantly increase the 
binding to EcRE or responsiveness to murA, relative to EOU (Figures 2A and 2B). 
ENR constructs, like ENUs, also demonstrated a greater affinity for the probe than the 
reversed constructs (Figure 2A). Unlike ENU constructs, ENR FDs had a high degree 
of ligand dependence for endodimer formation. E5R displayed a slightly decreased 
shift from the rate of basal transcription, and a slightly elevated ligand-stimulated 
shift (1 1 -fold relative induction) in comparison to EOR (7-fold), much like ROE and 
R5E. Non-transfected cells, or cells transfected only with E, displayed no detectable 
shift even after prolonged exposure. At concentrations of protein higher than those 
used in these gel-shift reactions, a shift of E in combination with endogenous RXR 
was observed. Separate experiments also confirmed that FDs specifically boxmd 
EcREs and not unrelated control probes that included thyroid hormone response 
elements. 

Figure 3 shows relative luciferase expression of FD constructs with or without 
1 |iM murA. The results of these studies are summarized in Table 1 below and show 
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repression of monomeric receptors and monomeric dimer partners by EOU and UOE 
FDs. 

TABLE 1 





UOE 


ROE 


U5E 


R5E 


EOU 


EOR 


E5U 


E5R 


E 


E+U 


-Lig/E4 


0.75 


0.18 


0.52 


0.25 


0.17 


0.21 


0.41 


0.46 


1.08 


1.74 


+Lig/E4 


0.55- 


1.75 


0.69 


1.50 


0.40 


3.39 


0.62 


3.89 


7.72 


8.34 


Rel. 


0.7 


9.7 


2.7 


6.0 


2.4 


16.2 


1.6 


8.4 


7.1 


4.8 


Ind. 























-Lig/E4 = luciferase activity in cells transiently co-transfected with FDs and 
monomeric receptors vs. reporter only without ligand 

+Lig/E4 =luciferase activity in cells transiently co-transfected with FDs and 
monomeric receptors vs. reporter only with ligand 

Rel. Ind. = relative induction of individual receptor groups 

As shown in Table 1, UNE and ENU constructs either did not stimulate 
luciferase expression or appeared to actually function as repressors of basal 
transcription. EOU in the absence of ligand, for instance, reduced E4-luc transcription 
to only 17% of the basal expression level. RNE and ENR constructs, by comparison, 
functioned much more like separate E+R, although both the basal and induced levels 
were proportionally decreased relative to the monomeric receptors. As might be 
predicted from the mobility shift experiments, E5R provided the closest profile to the 
monomeric (i.e., wild type) separate receptors, having approximately 50% of the E+R 
induced expression level. By contrast, both EOR and E5R, however, had greater 
relative inductions than separate E+R (16.2- and 8.4-fold, respectively, versus 7.1- 
fold). 

EXAMPLE 4 

Addition of the potent VP 16 transactivation (x) domain to the N-terminus of 
FD constructs was used to further examine transactivation by FDs. To test the 
possibility that distortion of the endodimer by a short linker (0 linker segments) 
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contributed to inhibited transactivation, the number of Unker segments in the linker 
between the units in the invention fusion proteins was expanded to a maximum of 20 
Unker segments for ENR and ENU variants. As shown in Figure 4A, addition of a 
VP16 T domain to either the E monomer (VE) or to FDs (VENR or VENU) resulted 

5 in a 12 to 15-fold overall increase in luciferase expression. These VENR FDs 
containing linkers of variable length produce a stimulated level of transcription 
virtually identical to separate VE protein, suggesting that augmentation of ENR 
proteins with VP16 x domain compensated for any loss in transactivation resulting 
from the fusion of the receptor and dimer partner into the invention fusion proteins. 

10 Notably, however, the level of basal, induced expression was increased 7 to 8-fold 
over monomeric receptors. This resulted in a dramatic decrease in the relative fold 
induction, from 27.8-foid for separate VE to 4.1 to 5.4-fold for FDs (Figure 4A). In 
addition, the increase from a 5 amino acid linker to a 245 amino acid linker had little 
effect on either basal or activated VENR transactivation with the exception of a subtle 

15 decrease in both levels in the FDs having the longer 125 (10 linker segments) and 245 
amino acid (20 linker segment) linkers. As shown in Figure 4B, VENU constructs 
displayed neither the basal nor induced level of expression of separate VE+U, even 
though the overall level of transactivation was significantly increased relative to E+U 
complexes without a heterologous transactivating domain. Like the VENR proteins, 

20 the addition of increased linker segments to the VENU constructs had a minimal 
effect on basal or induced transcriptional activation. 

EXAMPLE 5 

To elucidate the propensity of fusion protein partners to dimerize with each 
other over other monomeric suitable dimer partners, the influence of monomeric 

25 dimer partners on FD transactivation properties was assayed. In the transient 

transfection experiments shown in Figure 6, R and U proteins with N-terminal VP 16 
fusions (VR and VU, respectively) were utilized to probe E5U and E5R promiscuity. 
E5R and E5U constructs were used because previous experiments suggested that they 
displayed properties that were the most similar to monomeric (i.e., wild type) 

30 receptors. When both E5R and E5U constructs were cotransfected into 293 cells in 
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equimolar quantities along with the E4-luc reporter, VR cotransfection was not 
observed to significantly influence either E5U or E5R function, even though VR was 
found to augment E-mediated transactivation alone by >5-fold either v^th or without 
ligand (Figure 6). VU, on the other hand, delectably interacted with E5R, and to a 
5 lesser extent with E5U. Ligand-dependent transactivation was observed for 

E5U+VUsp at approximately 10% of the ligand-stimulated level of VUsp with E, 
whereas E5R+VUsp activated to nearly 50% of the VUsp + E level. VR and VU 
alone had no influence on E4-luc expression either with or without murA. 

EXAMPLE 6 

10 Using a modification of the method for constructing invention fusion proteins 

described above in Example 1 , fusion proteins were constructed having a Bombyx 
ecdysone receptor (BEcR) in the amino terminal half of the fusion protein, a linker 
bridge of 5 amino acids, and either RXR (BEOR) or Usp (BEOU) as the dimer partner 
placed at the C-terminal half of the fusion protein. To facilitate cloning, the BEcR 

15 amino acid sequence in each of these fusion proteins was augmented at the C-terminal 
end with amino acids 650-878 from the Drosophila melanogaster ecdysone receptor. 
Similar constructs were made wherein a His tag was positioned at the amino terminus 
of the fusion protein to facilitate purification of the fusion protein (HisBEOR and 
HisBEOU, respectively). 

20 Gel mobility shift assays using tebufenozide and MurA as ligand were 

conducted as described in Example 3 above to determine whether functional dimers 
formed from the BEOR and BEOU fusion proteins. The results of these assays showed 
the both BEOR and BEOU dimerize and constitutively bind target DNA irrespective of 
the presence of ligand. When the study was repeated using HisDEOR, HisBEOR and 

25 HisBEOU, it was determined that the His tag does not effect the ability of these fusion 
proteins to bind target DNA. 

It will be apparent to those skilled in the art that various changes may be made 
in the invention without departing from the spirit and scope thereof, and therefore, the 
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invention encompasses embodiments in addition to those specifically disclosed in the 
specification. 
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